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REMARKS 

Claims 1-5 and 7-11 are presented for examination. Applicants respond below to the 
specific rejections raised by the Examiner in the pending Office Action. For the reasons set forth 
below, Applicants respectfully traverse. 

35 U.S.C. SS lOZre^ and W3(a\ ~ Afar et al. 

The Examiner has rejected claims 1-5 and 7-11 under 35 U.S.C. § 102(e) or alternatively 
35 U.S.C. § 103(a) over Afar et al (U.S. Pub. No. 2005/0019870). The Examiner states that 
Afar discloses a prostate tumor associated gene (24P4C12) and its encoded protein. The 
Examiner asserts that the 24P4C12 protein, which is 710 amino acids long, is identical to amino 
acids 9-321 of SEQ ID NO: 10. The Examiner states that Afar discloses and claims anti- 
24P4C12 antibodies. 

The Afar reference does not expressly or inherently disclose the claimed antibodies 

In response to Applicants' arguments that the Afar reference does not expressly or 
inherently disclose an antibody which specifically binds to the polypeptide of SEQ ID NO: 10 
since the majority of 24P4C12 does not correspond to SEQ ID NO: 10, the Examiner asserts that 
"[b]ased on the Kyte-Doolittle plot. Afar clearly identifies immunogenic regions that are 
identical to the unmunogenic regions in applicants' SEO ID NO: 10 a nd clearly discloses 
antibodies to those identical immunogenic regions ." Office Action at 5 (emphasis added). To 
support this assertion, the Examiner cites page 34, full paragraph 3. See Office Action at 6. No 
antibody is disclosed at "page 34, Ml paragraph 3." Page 34 of US 2005/0019870 only contains 
a portion of the sequence listing for the application, and there is no disclosure of any specific 
antibody on that page. 

Applicants have reviewed the Afar and find no disclosure in Afar that "clearly identifies 
immunogenic regions that are identical to the immunogenic regions in applicants' SEQ ID 
NO: 10 and clearly discloses antibodies to those identical immunogenic regions." The closest 
disclosure Applicants can find is in T|[01 12], where Afar states that "[t]he amino acid sequence of 
the 24P4C12 ... may be used to select specific regions of the 24P4C12 protein for generating 
antibodies," and that these regions can be identified using methods known in the art, such as 
Kyte-Doolittle analysis. Afar dX^QUl] (emphasis added). However, Afar does not disclose 
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that Afar et al. actually utilized the 24P4C12 sequence to select a specific region for making 
antibodies (with one exception discussed below). As a result, Afar does not identify or disclose 
any specific hydrophilic or immunogenic regions of the 24P4C12 protein that can or should be 
used. Thus, contrary to the Examiner's assertion, Afar does not "clearly identifty] immunogenic 
regions that are identical to the immunogenic regions in applicants' SEQ ID NO: 10 and clearly 
disclosen antibodies to those identical immunogenic regions." At best. Afar suggests making 
antibodies to immunogenic regions of the 24P4C12 protein - not a particularly illuminating 
disclosure since one cannot make antibodies to non-immuno genie regions. Afar's teachings are 
general in nature and cannot be anticipatory because these teachings do not describe the 
invention in as full detail as that which is claimed. Absent such teachings. Afar cannot anticipate 
the claims. 

At best, this is a disclosure of a potential genus of antibodies that one of skill in the art 
could make, and no more. It is not an express or inherent disclosure of any particular species of 
antibodies, much less an antibody as claimed in the present application. Afar simply does not 
disclose a particular species of antibody based on this teaching. Applicants invite the Examiner 
to point to the particular portion of Afar where a particular antibody is disclosed - Applicants are 
aware of only one. 

The only example of an antibody to a specific portion of the 24P4C12 protein disclosed in 
Afar is found in Example 4. Afar teaches a polyclonal antibody to a peptide "corresponding to 
ammo acids 1-14 ... of the 24P4C12 protein sequence." Afar at 1f[0235]. As Applicants have 
previously noted, according to the Examiner, amino acids 398-710 of 24P4C12 disclosed in Afar 
are identical to amino acids 9-321 of SEQ ID NO: 10, but not amino acids 1-397. Thus, the 
Examiner has not established that amino acids 1-14 of the 24P4C12 protein sequence are similar 
to SEQ ID NO: 10 of the instant specification. Absent such a showing, there is no basis to assert 
that the antibodies disclosed in Example 4 of Afar anticipate or render obvious the penduig 
claims. 

In addition, Afar discloses a prophetic example of monoclonal antibodies to the 24P4C12 
protein in Example 9. That example discloses that "[i]n order to generate 24P4C12 monoclonal 
antibodies, a glutathione -S-transferase (GST) fusion protein encompassing a24P4C12 protein is 
synthesized and used as immunogen. Alternatively, 24P4C12 can be conveniently expressed in 
293T cells transfected with a CMV-driven expression vector encoding 24P4C12 with a C- 
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terminal 6xHis and MYC tag (pcDNA3.1/mycHIS, Invitrogen)." Afar at t[0251] (emphasis 
added). As noted previously, according to the Examiner's alignment, 397 of the 710 amino acids 
of 24P4C12 are apparently completely different from SEQ ID NO: 10. That means that 56% of 
24P4C12 differs from SEQ ID NO:10. Thus, there is no reason to believe that a monoclonal 
antibody generated by use of the entire 24P4C12 protein would satisfy the limitations of the 
pending claims. 

In sum, Afar does not disclose any antibodies to any specific region of 24P4C12, other 
than amino acids 1-14. Instead, there is merely a generic disclosure of antibodies to 24P4C12. 
While it is possible that an antibody to 24P4C12 would bind the polypeptide of SEQ ID NO: 10, 
it is not a certainty since more than half of 24P4C12 apparently bears no similarity to SEQ ID 
NO: 10. Mere possibility is not sufficient for inherent anticipation: "Inherency, however, may 
not be established by probabilities or possibUities. The mere fact that a certain thing max 
result from a given set of circumstances is not sufficient." M.P.E.P. §2112 HIV (gth ed. 2004), 
quoting In re Robertson, 169 F.3d 743, 745 (Fed. Cir. 1999) (emphasis added). 

The Examiner attempts to overcome this shortcoming by generating a Kyte-Doolittle plot 
some 5 years after the filing of the instant application, This plot is in no way inherent in Afar, 
and does not constitute part of the Afar disclosure. Thus, regardless of whether the plot is 
accurate or not, a point Applicants have not conceded, it is not a part of the Afar disclosure and 
does not transform Afar's generic disclosure of possible antibodies to 24P4C12 into a disclosure 
of specific antibodies. As noted previously, the plot shows that there are hydrophilic regions 
throughout the entire length of 24P4C12, including amino acids 1-397 . which have no apparent 
similarity to SEQ ID NO: 10. Thus, the Examiner's post-filing Kyte-Doolittle plot, even if 
accurate, discloses nothing more specific regarding the claimed antibodies beyond Afar's generic 
disclosure of generating antibodies. Absent the plot, Afar generically discloses possible 
antibodies to immunogenic portions of 24P4C12, some of which antibodies may, but do not 
necessarily bind the polypeptide of SEQ ID NO: 1 0. In view of the plot, Afar teaches exactly the 
same thing - Afar generically discloses possible antibodies to immunogenic portions of 
24P4C12, some of which antibodies may, but do not necessarily bind the polypeptide of SEQ ID 
NO: 10. A generic disclosure of possible antibodies to an entire protein is not a disclosure of 
antibodies to a specific region and the Examiner's plot does not change this fact. The disclosed 
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generic teachings toward generating antibodies to 24P4C12 clearly do not inherently possess the 
claimed features of the pending claims. 



The Afar reference is not available as prior art 

The Examiner stated in the previous Office Action that "[ajccording to M.P.E.P. § 
715.n.B, an affidavit or declaration under 37 C.F.R. 1.131 is not appropriate in situations where 
the reference U.S. patent or U.S. patent application publication claims the same patentable 
invention." Office Action at 9. Applicants respectfully disagree. 

M.P.E.?. §2136.05 states: 

When a prior U.S. patent, U.S. patent application publication, or 
international application publication is not a statutory bar, a 35 U.S.C. 102(e) 
rejection can be overcome by antedating the filing date . ..of the reference by 
submitting an affidavit or declaration under 37 CFR 1.131... In re Mathews, 408 
F.2d 1393, 161 USPQ 276 (CCPA 1969). M.P.E.P. § 2136.05 (emphasis added). 

According to the M.P.E.P., "[a]n exception to the rule arises when the reference is a 

patent or application published under 35 U.S.C. 122(b) and the reference has claims directed to 

the same patentable invention as the application claims being rejected. 37 CFR 1.131(a)(1). The 

reason for this exception is that priority is determined in an interference when the clahns 

interfere." M.P.E.P. § 2305 1. However, the M.P.E.P continues, stating that: 

In determining whether a 37 CFR 1.131 affidavit is permitted or not, the 
examiner should keep the purpose of the exception in mind. If an interference 
would not be possible at the time the affidavit would be submitted, then the 
affidavit should be permitted. This situation could arise two ways. . . . Similarly, 
if a published application contains claims to the same invention, but the claims in 
the published application are not in condition for allowance, then no interference 
is yet possible . 37 CFR 41.102. Since the claims in the published application 
might never be allowed in their present form, it is not appropriate to proceed as 
though an interference would be inevitable. Consequently, an affidavit under 37 
CFR 1 . 1 3 1 mav be submitted . M.P.KP. § 2305 1 (emphasis added). 

A non-final Office Action rejecting the claims of the Afar application was mailed on 
December 28, 2007, rejecting the claims under 35 U.S.C. § 102(e) over several references. A 
response to the non-final Office Action vras forwarded to the Examiner for consideration on May 
19, 2008. In view of the fact that the Afar application is not yet in condition for allowance, 
Applicants submit that it is permissible for Applicants to antedate the Afar reference in the 
instant case. 
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The nucleic acid and polypeptide sequences of PR0874 (SEQ ID NOs: 9 and 10 of the 
instant application) were disclosed by Applicants in US Provisional Application 60/088030 filed 
June 4, 1998 in Figures 1 and 2, a copy of which is attached as Exhibit 1. Applicants also 
disclose antibodies to PR0874 in US Provisional Application 60/088030, including polyclonal, 
monoclonal, and humanized antibodies. Exhibit 1 at pages 22-28. 

The well-established "Stempel Doctrine" stands for the proposition that a patent applicant 
can effectively swear back of and remove a cited prior art reference by showing that he or she 
possessed that portion of the claimed invention that is disclosed in the prior art reference. (In re 
Stempel, 113 U.S.P.Q. 77 (CCPA 1957)). In other words, a patent applicant need not 
demonstrate that he or she made the entire claimed invention in order to remove a cited prior art 
reference. He or she need only demonstrate prior possession of that portion of his or her claimed 
invention that is disclosed in the prior art reference and nothing more. 

The Examiner asserts that Afar discloses a sequence which is at least partially identical to 
SEQ ID NO: 10, as well as antibodies to the sequence. Applicants demonstrate, by means of the 
disclosure of antibodies to PR0874 (SEQ ID NO: 10) in their provisional application filed June 4, 
1998, that they were in possession of as much of the claimed invention as is disclosed in the Afar 
reference prior to its earliest possible effective filmg date, April 12, 1999. Thus, Applicants 
respectfully submit that Afar is not available as prior art under 102(e). 

In light of the above, Applicants respectfully request that the Examiner reconsider and 
withdraw the 35 U.S.C. §§ 102(e)/103(a) rejection of the pending claims over Afar et al 
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CONCLUSION 

In view of the above, Applicants respectfully maintain that claims are patentable and 
request that they be passed to issue. Applicants invite the Examiner to call the undersigned if any 
remaining issues may be resolved by telephone. 

Please charge any additional fees, including any fees for additional extension of time, or 
credit overpayment to Deposit Account No . 11-1410. 

Respectftilly submitted, 

KNOBBE, MARTENS, OLSON & BEAR, LLP 



Dated: ^/T-Z^/c^^ 



Brenden S. Gingrich 

Registration No. 60,295 
Attorney of Record 
Customer No. 30,313 
(619)235-8550 
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NOVEL MULTI-SPAN TRANSMEMBRANE POLYPEPTIDES 
AND NUCLEIC ACIDS ENCODING THE SAME 

FTFT.n OF THE INVENTION 
The present invention relates generally to the identification and isolation of novel DNA 
5 and to the recombinant production of novel multi-span transmembrane polypeptides designated 
herein as "PR0874" polypeptides. 

f BArKOROUND OF THE INVENTION 

•"- Membrane-bound proteins and receptors can play an important role in the formation, 

"10 differentiationand maintenance of multicellular organisms. The fate of many individual cells, 
n e.g., proliferation, migration, differentiation, or interaction with other cells, is Qfpically 
^" governed by information received from other cells and/or the immediate environment. This 
information is often transmitted by secreted polypeptides (for instance, mitogenic factors, 
survival factors, cytotoxic factors, differentiation factors, neuropeptides, and hormones) which 
,'15 are, in turn, received and interpreted by diverse cell receptors or membrane-bound proteins. 
Such membrane-bound proteins and cell receptors include, but are not limited to, cytokine 
receptors, recqptor kinases, receptor phosphatases, receptors involved in cell-cell interactions, 
and cellular adhesin molecules like selectins and integrins. For instance, transduction of 
signals that regulate cell growth and differentiation is regulated in part by phosphorylation of 
20 various cellular proteins. Proteintyrosinekinases.enzymesthatcatalyzethatprocess.canalso 
act as growth factor receptors. Examples include fibroblast growth factor receptor and nerve 
growth factor receptor. 

Membrane-bound proteins and receptor molecules have various industrial applications, 
including as pharmaceutical and diagnostic agents. Receptor inununoadhesins, for instance, 
25 can be employed as therapeutic agents to block receptor-ligand interaction. The membrane- 
bound proteins can also be employed for screening of potential peptide or small molecule 
inhibitors of the relevant receptor/ligand interaction. 
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Efforts are being undertaken by both industry and acaderaia to identify new, native 
receptor and transmembrane proteins. Many effons are focused on the screening of 
manjmalian recombinant DNA libraries to identify the coding sequences for novel receptor and 
transmembrane proteins. 

Described herein is the identification and characterization of a novel multi-span 
5 transmembrane polypeptide. 

SUMMARY OF THF. TNVF.NTTON 
Applicants have identified a cDNA clone that encodes a novel multi-span 
' transmembrane polypeptide, which is designated in the present application as "PR0874", 
10 In one embodiment, the invention provides an isolated nucleic acid molecule 

comprising DNA encoding a PR0874 polypeptide. In one aspect, the isolated nucleic acid 
H comprises DNA encoding the PR0874 polypeptide having amino acid residues 1 to 321 of 
Figure 2 (SEQ ID NO:3), or is complementary to such cncodmg nucleic acid sequence, and 
' S remains stably bound to it under at least moderate, and optionally, under high stringency 
= js conditions. In other aspects, the isolated nucleic acid comprises DNA encoding the PR0874 
r polypeptide havuig amino acid from about X to 321 of Figure 2 (SEQ ID NO:3), where X is 
^ any amino acid from about 270 to about 279 of Figure 2 (SEQ ID N0:3), or is complementary 
P to such encoding nucleic acid sequence , and remains stably bound to it under at least moderate, 
h and optionally, under high stringency conditions. The isolated nucleic acid sequence may 
'io comprise the cDNA insert of the DNA40621-1440 vector deposited on June 2, 1998, as ATCC 

which includes flie nucleotide sequence encoding PR0874. 

In another enibodiment, the invention provides a vector comprismg DNA encoding a 
PR0874 polypeptide. A host cell comprising such a vector is also provided. By way of 
example, the host cells may be CHO cells, E. coli, or yeast. A process for producing PR0874 
25 polypeptides is further provided and comprises culturing host cells under conditions suitable 
for expression of PR0874 and recovering PR0874 fi-om the cell culture. 

In another embodiment, the invention provides isolated PR0874 polypeptide. In 
particular, the invention provides isolated native sequence PR0874 polypeptide, which in one 
embodiment, includes an amino acid sequence comprising residues 1 to 321 of Fi^jre 2 (SEQ 
30 ID N0:3). Additional embodiments of the present invention are directed to PR0874 
polypeptides comprising amino acids X to 321 of Figure 2 (SEQ ID NO:3), where X is any 
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amino acid from about 270 to about 279 of Fig^ire 2 (SEQ ID N0:3). OptionaUy, the PR0874 
polypeptide is obtained or is obtainable by expressing the polypeptide encoded by the cDNA 
insert of the DNA'«)621-1440 vector deposited on June 2, 1998, as ATCC 



In another embodiment, the invention provides chimeric molecules comprising a 
5 PR0874 polypeptide or soluble extracellular domain thereof fused to a heterologous 
polypeptide or amino acid sequence. An example of such a chimeric molecule comprises a 
PR0874 polypeptide fused to an epitope tag sequence or a Fc region of an immunoglobulin. 

In another embodiment, the invention provides an antibody which specifically binds to 
a PR0874 polypeptide or extracellular domain thereof. OptionaUy, the antibody is a 
10 monoclonal antibody. 

=b BRIEF DESCRIPTION OF THE DRAWINGS 

'=/ Figures lA and IB show a nucleotide sequence (SEQ ID NO:l) containing the 

nucleotide sequence (SEQ ID NO:2) of a native sequence PR0874 cDNA (nucleotides 1-963), 
\>15 wherein the nucleotide sequence (SEQ ID NO:l) is a clone designated herein as "UNQ441 " 
and/or ''DNA40621-144O". Also presented are the locations (as underlined) of 
;: . oligonuclectide probes based upon the DNA40621-1440 sequence, wherein those probes are 
1 designated herein as "40621 .tm.fl " , "40621 .tm.pl" and "40621 .tm.rr . The deduced amino 
,f acid sequence encoded by the DNA40621-1440 molecule is shown below the nucleotide 
■'20 sequence. 

Figure 2 shows the amino acid sequence (SEQ ID NO:3) derived from the nucleotide 
sequence shown in Figure 1. Also shown are the approximate locations of various other 
protein domains. 

Figures 3A to 3H show an alignment of nucleotide sequences from a variety of 
25 expressed sequence tags as well as a consensus nucleotide sequence derived therefrom 
designated "consenOl", and also referred to herein as DNA36459. 

Figure 4A to 4B show a double-stranded representation of the DNA36459 sequence 
(SEQ ID N0:4) shown in Figures 3A to 3H as well as the locations (as underlined) of 
oligonucleotide primers based upon the DNA36459 consensus sequence, wherein the 
30 oligonucleotide primers are herein designated "36459.fr', "36459.rl" and "36459.pl". The 
deduced amino acid sequence encoded by the DNA36459 sequence is shown below the 
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nucleotide sequence. 



DETAILED DESCRIPTION OF THE PREFERRED KMBODIMENTS 
I. Definitions 

The terms ''PR0874 polypeptide" and "PR0874" when used herein encompass native 
sequence PR0874 and PR0874 polypeptide variants (which are further defined herein). The 
PR0874 polypeptides may be isolated from a variety of sources, such as from human tissue 
types or from another source, or prepared by recombinant or synthetic methods. 

A "native sequence PR0874 polypeptide" comprises a polypeptide having the same 
amino acid sequence as a PR0874 polypeptide derived from nature. Such native sequence 
PR0874 polypeptide can be isolated from nature or can be produced by recombinant or 
synthetic means. The term "native sequence PR0874 polypeptide" specifically encompasses 
naturally-occurring truncated or secreted forms of a PR0874 polypeptide {e.g. , soluble forms 
containing for instance, an extracellular domain sequence), naturally-occurring variant fomvs 
(e.g., alternatively spliced forms) and naturally-occurring allelic variants of a PR0874 
polypeptide. In one embodiment of the invention, the native sequence PR0874 polypeptide 
comprises amino acids 1 to 321 of Figure 2 (SEQ ID N0:3). Optionally, the PR0874 
polypeptide is obtained or obtainable by expressing the polypeptide encoded by the cDNA 
insert of the vector DNA40621-1440 deposited on June 2, 1998. as ATCC 

The "PR0874 extracellular domain" or "PR0874 ECD" refers to a form of the 
PR0874 polypeptide which is essentially free of the transmembrane and cytoplasmic domains 
of the PR0874 polypeptide. Optionally the PR0874 ECD will comprise amino acids 275 to 
321 of Figure 2 (SEQ ID NO:3). Ordinarily, PR0874 ECD will have less than 1 % of such 
transmembrane and/or cytoplasmic domains and preferably, will have less than 0.5% of such 
domains. It will be understood that any transmembrane domain identified for the PR0874 
polypeptide of the present invention is identified pursuant to criteria routinely employed in the 
art for identifying that type of hydrophobic domain. The exact boundaries of a transmembrare 
domain may vary but most likely by no more than about 5 amino acids at either end of the 
domain as initially identified. Accordingly, the PR0874 polypeptide ECD may optionally 
comprise amino acids X to 321 of Figure 2 (SEQ ID NO:3), wherein X is any one of amino 
acid residues 270 to 279 of Figure 2 (SEQ ID NO:3). 



"PR0874 variant" means an active PR0874 polypeptide as defined below having at 
least about 80% amino acid sequence identity with the PR0874 polypeptide having the 
deduced amino acid sequence shown in Figure 2 (SEQ ID N0:3). Such PR0874 polypeptide 
variants include, for instance, PR0874 polypeptides wherein one or more amino acid residues 
are added, or deleted, at the N- or C-terminus of the sequence of Figure 2 (SEQ ID NO:3), 
5 Ordinarily, a PR0874 polypeptide variant will have at least about 80% amino acid sequence 
identity, preferably at least about 85% amino acid sequence identity, more preferably at least 
about 90% amino acid sequence identity and even more preferably at least about 95% amino 
acid sequence identity with the amino acid sequence of Figure 2 (SEQ ID N0:3). The term 
variant does not encompass the known peptides which have identity with the PR0874 
10 described herein. 

"Percent (%) amino acid sequence identity" or "percent (%) nucleic acid sequence 
ri identity" when referring to the PR0874 amino acid sequences or DNA40621-1440nucleic acid 
Jjj sequences identified herein, respectively, is defined as the percentage amino acid residues or 
■i^ nucleic acid residues in a candidate sequence that are idetitical with the amino acid residues 
hl5 in a PRD874 polypeptide sequence, or the nucleic acid residues in a DNA40621-1440 nucleic 
acid sequence, after aligning the sequences and introducing gaps, if necessary, to achieve the 
^ maxhnum percent sequence identity , and not considering any conservative substitutions as part 
of the sequence identity. The % identity values used herein were generated using the WU- 
BLAST-2 computer program (Altschul et al. , Methods in Enzymology 266:460-480 (1996); 
"iO http://blast.wustl/edu/biast/README.htnil). Most of the WU-BLAST-2 search parameters 
were set to the default values. The adjustable parameters were set with the following values: 
overlap span = 1, overlap fraction = 0.125, word threshold (T) = 11, and scoriig matrix = 
BLOSUM62- The HSP S and HSP S2 parameters, which are dynamic values used by BLAST- 
2, are established by the program itself depending upon the composition of the sequence of 
25 interest and composition of the database against which the sequence is being searched. 
However, the values may be adjusted to increase sensitivity. A % sequence identity value is 
determined by the fraction of matching identical residues divided by the total number of 
residues of the "longer" sequence. The "longer" sequence is the one having the most actual 
residues in the aligned region (gaps introduced by WU-BLAST-2 to maximize die alignment 
30 score are ignored). 

Percent amino acid or nucleic acid sequence identity can also be detennined in other 
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ways that are within the skill in the art, for instance, using other publicly available computer 
software such as ALIGN or Megalign (DNASTAR) software. Those skilled in the art can 
determine appropriate parameters for measuring alignment, including any algorithms needed 
to achieve maximal alignment over the full length of the sequences being compared. 

The term "epitope tagged" where used herein refers to a chimeric polypeptide 
5 comprising a PR0874 polypeptide, or domain sequence thereof, fiised to a "tag polypeptide" . 
The tag polypeptide has enough residues to provide an epitope against which an antibody may 
be made, or which can be identified by some other agent, yet is short enough such that it does 
not interfere with the activity of the PR0874 polypeptide. The tag polypeptide preferably is 
also fairly unique so that the antibody does not substantially cross-react with other epitopes. 
10 Suitable tag polypeptides generally have at least six amino acid residues and usually between 
about 8 to about 50 amino acid residues (preferably, between about 10 to about 20 residues). 

z "Isolated," when used to describe the various polypeptides disclosed herein, means 

^ polypeptide that has been identified and separated and/or recovered from a component of its 
natural environment. Contaminant components of its namral environment are materials that 

■ 15 would typically interfere with diagnostic or therapeutic uses for the polypeptide, and may 
include enzymes, hormones, and other proteinaoeous or non-proteinaceous solutes. In 

~ preferred embodiments, the polypeptide will be purified (1) to a degree sufficient to obtain at 
least 15 residues of N-terminal or internal amino acid sequence by use of a spinning cup 

.f: sequenator, or (2) to homogeneity by SDS-PAGE under non-reducing or reducing conditions 

'20 using Coomassie blue or, preferably, silver stain. Isolated polypeptide inchides polypeptide 
in situ within recombinant ceUs, since at least one component of the PR0874 polypeptide 
namral environment will not be present. Ordinarily, however, isolated polypeptide will be 
prepared by at least one purification step. 

An "isolated" PR0874 polypeptide-encoding nucleic acid molecule is a nucleic acid 

25 molecule that is identified and separated from at least one contaminant nucleic acid molecule 
with which it is ordinarily associated in the natural source of the PR0874 polypeptide- 
encoding nucleic acid. An isolated PR0874 polypeptide-encoding nucleic acid molecule is 
other than in the form or setting in which it is found in nature. Isolated PR0874 polypeptide- 
encoding nucleic acid molecules therefore are distinguished from the PR0874 polypeptide- 

30 encoding nucleic acid molecule as it exists in namral cells. However, an isolated PR0874 
polypeptide-encoding nucleic acid molecule includes PR0874 polypeptide-encoding nucleic 
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acid molecules contained in cells that ordinarily express PR0874 polyp^tide where, for 
example, the nucleic acid molecule is in a chromosomal location different from that of natural 
cells. 

The term "control sequences" refers to DNA sequences necessary for the expression 
of an opetably linked coding sequence in a particular host organism. The control sequences 
5 that are suitable for prokaryotes, for example, include a promoter, optionally an operator 
sequence, and a ribosome binding site, Eukaryotic cells are known to utilize promoters, 
polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with 
another nucleic acid sequence. For example, DNA for a presequence or secretory leader is 
10 operably linked to DNA for a polypeptide if it is expressed as a preprotdn that participates in 
the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding 
■ ^ sequence if it affects the transcription of the sequence; or a ribosome binding site is operably 
.1 linked to a coding sequence if it is positioned so as to facilitate translation. Generally, 
jif "operably linked" means that the DNA sequences being luiked are contiguous, and, in the case 
i. J5 of a secretory leader, contiguous and in reading phase. However, enhancers do not have to 
r be contiguous. Linking is accomplished by ligation at convenient restriction sites. If such 
^z; sites do not exist, the synthetic oligonucleotide adaptors or luikers are used in accordance with 
■ conventional practice . 

"antibody" is used in the broadest sense and specifically covers single anti- 
■^0 PR0874 polypeptide monoclonal antibodies (including agonist, antagonist, and neutralizing 
antibodies) and anti-PR0874 antibody compositions with polyepitopic specificity. The term 
"monoclonal antibody" as used herein refers to an antibody obtained firom a population of 
substantially homogeneous antibodies, /. e. , the individual antibodies comprising the populatioi 
are identical except for possible naturally-occurriiig mutations that may be present in minor 
25 amounts. 

"Active" or "activity" for the purposes herein refers to form(s) of PR0874 which 
retain the biologic and/or immunologic activities of native or naturally-occurring PR0874 
polypeptide. 

The terms "treating", "treatraenf and "flierapy" as used herein refer to curative 
30 therapy, prophylactic therapy, and preventative therapy. 

The term "mammal" as used herein refers to any mammal classified as a mammal, 
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including humans, cows, horses, dogs and cats. In a preferred embodiment of the invention, 
the mammal is a human. 



n. Compositions and Methods of the Tnventinn 

A. Native PR08 74 Pol ypepti d e 
5 The present invention provides newly identified and isolated nucleotide sequences 

encoding polypeptides referred to in the present application as PR0874. In particular, 
Applicants have identified and isolated cDNA encoding a PR0874 polypeptide, as disclosed 
m further detail in the Examples below. The PR0874-^ncoding clone was isolated from a 
human fetal lung library. To Applicants present knowledge, the UNQ441 (DNA40621-1440) 
10 nucleotide sequence encodes a novel factor. Although, as described below in Example 1, 
using BLAST and FastA sequence alignment computer programs , some seqjuence identity with 
?i known proteins was revealed. 

i B. PRQS74 Variants 

^;I5 In addition to the native sequence PR0874 polypeptide described herein, it is 

contemplated that PR0874 variants can be prepared. PR0874 variants can be prepared by 
^ introducing appropriate nucleotide changes into the PR0874-encoding DNA, or by synthesis 
Q of the desired PR0874 polypeptide. Those skilled in the art will ^preciate that amino acid 
^ changes may alter post-translational processes of the PR0874 polypeptide, such as changing 
20 the number or position of glycosylation sites or altering the membrane anchoring 
characteristics. 

Variations in the native sequence of the PR0874 polypeptide or in various domains of 
the PR0874 polypeptide described herein, can be made, for example, using any of the 
techniques and guidelines for conservative and non-conservative mutations set forth, for 

25 instance, in U.S. Patent No. 5,364,934. Variations may be a substitution, deletion or insertion 
of one or more codons encoding the PR0874 polypeptide that results in a change in the amino 
acid sequence of the PR0874 polypeptide as compared with the native sequence PR0874. 
Optionally the variation is by substitution of at least one amino acid with any other amino acid 
in one or more of the domains of the PR0874 polypeptide. Guidance in determining which 

30 amino acid residue may be inserted, substimted or deleted without adversely affecting the 
desired activity may be found by comparing the sequence of the PR0874 polypeptide with that 
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of homologous known protein molecules and minimizing the number of amino acid sequence 
changes made in regions of high homology. Amino acid substitutions can be the result of 
replacing one amino acid with another amino acid having similar structural and/or chemical 
properties, such as the replacement of a leucine with a serine, i.e., conservative amino acid 
replacements. Insertions or deletions may optionally be in the range of 1 to 5 amino acids. 
5 The variation allowed may be determined by systematically making insertions, deletions or 
substitutions of amino acids m the sequence and testing the resulting variants for activity in 
any of the in vUro assays described in the Examples below. 

The variations can be made using methods known in the art such as oligonucleotide- 
mediated (site-directed) mutagenesis, alanine scanning, and PGR mutagenesis. Site-directed 
10 mutagenesis [Carteret al., Nucl. Acids Res. . 15:4331 (1986); Zoller et al., Nucl. Acids Res. , 
10:6487 (1987)], cassette mutagenesis [Wells et aL. Gene . M:315 (1985)], restriction selectioi 
k: mutagenesis [Wells et al., Philos. Trans. R. Soc. London SerA . 217:415 (1986)] or other 
known techniques can be performed on the cloned DNA to produce the PR0874-encoding 
variant DNA. 

'■^15 Scanning amino acid analysis can also be employed to identify one or more amino acids 

along a contiguous sequence. Among the preferred scanning amino acids are relatively small, 

n 

It' neutral amino acids. Such amino acids include alanine, glycine, serine, and cysteine. Alanine 
is typically a preferred scanning amino acid among this group because it eliminates the side- 

'■u Cham beyond the beta-caibon and is less likely to alter the main-chain conformation of the 
20 variant. Alanine is also typically preferred because it is the most common amino acid. 
Further, it is frequently found in both buried and exposed positions [Creightcn, The Proteins . 
(W.H. Freeman* Co., N.Y.); Chothia, J. Mol. Biol. . I5Q:1 (1976)]. If alanine substiUition 
does not yield adequate amounts of variant, an isoteric amino acid can be used. 

25 C. Modifications of PR0874 

Covalent modificaticns of PR0874 polypeptides are included within the scope of this 
invention. One type of covalent modification includes reacting targeted amino acid residues 
of a PR0874 polypeptide with an organic derivatizing agent that is capable of reacting with 
selected side chains or the N- or C- terminal residues of a PR0874 polypeptide. 

30 Derivatization with bifunctional agents is useful, for instance, for crosslinking PR0874 to a 
water-insoluble support maurix or surface for use in the method for purifying anti-PR0874 
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antibodies, and vice-versa. Commonly used crosslinking agents include, e.g., l.l-bis(diazo- 
acetyl)-2-phenylethane,glutaraIdehiyde, N-hydroxysuccinimideesters, for e;cainple, esters with 
4-azidosalicylic acid, homobifunctional imidoesters, including disuccinimidyl esters such as 
3,3'-dithiobis(succinimidylpropionate)^ifunctional maleimides such as bis-N-maleimido-1 ,8- 
octane and agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate. 
5 Other modifications include deamidation of glutaminy 1 and asparaginyl residues to the 

corresponding glutamyl and aspartyl residues, respectively, hydroxylation of proline and 
lysine, phosphorylation of hydroxyl groups of seryl or threonyl residues, methylation of the 
a-amino groups of lysine, arginine, and histidine side chains [T.E. Creighton, Proteins: 
Structure and Molecular Properties W.H. Freeman & Co., San Francisco, pp. 79-86 {1983)], 
10 acetylation of the N-terminal amine, and amidation of any C-terminal carboxyi group. 

Another type of covalent modification of the PR0874 polypeptide included within the 

Q scope of this invention comprises altering the native glycosylation pattern of the polypeptide. 

m "Altering the native glycosylation pattern" is intended for purposes herein to mean deleting 
one or more carbohydrate moieties found in native sequence PR0874 polypeptide, and/or 

1.45 adding one or more glycosylation sites that are not present in the native sequence PR0874 

. polypeptide. 

m Addition of glycosylation sites to PR0874 polypeptides may be accomplished by 

altering the amino acid sequence thereof. The alteration may be made, for example, by the 
gi: addition of, or substitution by, one or more serine or threonine residues to the native sequence 
'io PR0874 polypeptide (for 0-linked glycosylation sites). The PR0874 amino acid sequence 

may optionally be altered through changes at the DNA level, paiticularly by mutating the DNA 

encoding the PR0874 polypeptide at preselected bases such that codons are generated that will 

translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the PR0874 
25 polypeptide is by chemical or enzymatic couplmg of glycosides to the polypeptide. Such 

methods are described in the art, e.g., in WO 87/05330 published 11 September 1987, and in 

Aplin and Wriston, CRC Crit. Rev. Biochem- pp. 259-306 (1981). 

Removal of carbohydrate moieties present on the PR0874 polypeptide may be 

accomplished chemically or enzymatically or by mutational substimtion of codons encoding 
30 for amino acid residues that serve as targets for glycosylation. Chemical deglycosylation 

techniques are known in the art and described, for instance, by Hakimuddin, et al.. A rc h , 
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Biochem. Biophvs. . 259:52 (1987) and by Edge et al., Anal. Biochem. . US: 131 (1981). 
Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of 
a variety of endo- and exo-glycosidases as described by Thotakura et al., Meth. Enzymol.. 
m350 (1987). 

Another type of covalent modification of PR0874 comprises linking the PR0874 
5 polypeptide to one of a variety of nonproteinaceous polymers, e.g., polyethylene glycol, 
polypropylene glycol, or polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 
4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

PR0874 polypeptides of the present invention may also be modified in a way to form 
chimeric molecules comprising a PR0874 polypeptide fused to another, heterologous 
10 polypeptide or amino acid sequence . In one embodiment , such a chimeric molecule comprises 
a fusion of a PR0874 polypeptide with a tag polypeptide which provides an epitope to which 
l::3 an anti-tag antibody can selectively bind. The epitope tag is generally placed at the amino- or 
^ j carboxyl- terminus of the PR0874 polypeptide. The presence of such epitope-tagged forms 
;i= of a PR0874 polypeptide can be detected using an antibody against the tag polypeptide. Also, 
jjiS provision of the epitope tag enables the PR0874 polypeptide to be readily purified by affinity 
i: purification using an anti-tag antibody or another type of affinity matrix that binds to the 
epitope tag. In an alternative embodiment, the chimeric molecule may comprise a fusion of 
a PR0874 polypeptide with an immunoglobulin or a particular region of an immunoglobulin. 
■15 For a bivalent form of the chimeric molecule, such a fusion could be to the Fc region of an 
" 20 IgG molecule. 

Various tag polypeptides and their respective antibodies are well known in the art. 
Examples include poly-histidine (poly-his) or poly-histidine-glycine(poly-his-gly)tags; the flu 
HA tag polypeptide and its antibody 12CA5 [Field et al., Mol. Cell. Biol. , 3:2159-2165 
(1988)]; the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 antibodies thereto [Evan 

25 et al. , Molecular and Cellular Biology . 5:3610-3616 (1985)]; and the Herpes Simplex virus 
glycoprotein D (gD) tag and its antibody [Paborsky et al., Protein Engineering . 2(6):547-553 
(1990)]. Other tag polypeptides include the Flag-peptide [Hopp et al., BioTechnology , 
fi: 1204-1210 (1988)1; the KT3 epitope peptide [Martin et al. , Science . 255 : 192-194 (1992)]; 
an a-mbulin epitope peptide [Skinner et al., J. Biol. Chem. , 26fi: 15163-15166 (1991)]; and 

30 the T7 gene 10 protein peptide tag [Lutz-Freyermuth et al., Proc. Natl, Acad, gci. VS A, 
£2:6393-6397 (1990)]. 
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The PR0874 polypeptide of the present invention may also be modified in a way to 
form a chimeric molecule comprising a PR0874 polypeptide iiised to a leucine zipper. 
Various leucine zipper polypeptides have been described in the art. See, e.g., Landschuk et 
al.. Science 240:1759 (1988); WO 94/10308; Hoppe et al., FEBS Letters 344:1991 (1994); 
Maniatis et al.. Nature 341:24 (1989). It is believed that use of a leucine zipper fused to a 
5 PR0874 polypeptide may be desirable to assist in dimerizing or trimerizing soluble PR0874 
polypeptide in solution. Those skilled in the art will appreciate that the leucine zipper may be 
fused at either the N- or C-terminal end of the PR0874 molecule. 

D. Preparation of PR0874 
10 The description below relates primarily to production of PR0874 by culturing cells 

transformed or transfected with a vector containing PR0874 polypeptide encoding nucleic 
U? acid. It is, of course, contemplated that alternative methods, which are well known in the art, 
jj; may be employed to prepare PR0874 polypeptides. For instance, the PR0874 sequence, or 
portions thereof, may be produced by direct peptide synthesis usii^ solid-phase techniques 

i^AS [see, e.g. , Stewart et al., Solid-Phase Peptide Synt^gsig , W,H. Freeman Co. , San Francisco, 

iVl 

CA (1969); Merrifield, J. Am. Chem. Snc . 55:2149-2154 (1963)]. In vitro protein synthesis 
j;t may be performed using manual techniques or by automation. Automated synthesis may be 

accomplished, for mstance, using an Applied Biosystems Peptide Synthesizer (Foster City, 
■'.u CA) using manufacturer's instructions. Various portions of PR0874 polypeptides may be 
^io chemically synthesized separately and combined using chemical or enzymatic methods to 

produce a PR0874 polypeptide. 

1. Isolation of DNA Encoding PRr>874 
DNA encoding a PR0874 polypeptide may be obtained from a cDNA library prepared 
25 from tissue believed to possess the PR0874 niRNA and to express it at a detectable level. 
Accordingly, human PR0874-encoding DNA can be conveniently obtained from a cDNA 
library prepared from human tissue, such as described in the Examples. The PR0874- 
encoding gene may also be obtained from a genomic library or by oligonucleotide synthesis. 
Libraries can be screened with probes (such as antibodies to a PR0874 polypeptide or 
30 oligonucleotides of at least about 20-80 bases) designed to identify the gene of interest or the 
protein encoded by it. Screening the cDNA or genomic library with the selected probe may 
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be conducted using standani procedures, such as described in Sambrook et al., Molgcular 
rioniTi ff! A TAhnramrv Manual (New York: Cold Spring Harbor Laboratory Press, 1989). 
An alternative means to isolate the gene encoding PR0874 is to use PGR methodology 
[Sambrook et al., supra : Dieffenbach et al., PCR Primer: A TflboratOiy Manual (Cold Spring 
Harbor Laboratory Press, 1995)]. 
5 The Examples below describe techniques for screening a cDNA library. The 

oligonucleotide sequences selected as probes should be of sufficient length and sufficiently 
unambiguous that false positives are minimized. The oligonucleotide is preferably labeled 
such that it can be detected upon hybridizationto DNA in the library being screened. Methods 
of labeling are well known in the art, and include the use of radiolabels like '^P-Iabeied ATP, 
10 biotinylation or enzyme labeling . Hybridizationconditions, including moderate stringency and 
high stringency, are provided in Sambrook et al., siuim. 

Sequences identified in such library screening methods can be compared and aligned 
to other known sequences deposited and available in public databases such as GenBank or 
other private sequence databases. Sequence identity (at either the ammo acid or nucleotide 
15 level) within defined regions of die molecule or across the full-length sequence can be 
determined through sequeiwe alignment using computer software programs such as ALIGN, 
DNAstar, and INHERIT which employ various algorithms to measure homology. 

Nucleic acid havmg protein coding sequence may be obtained by screening selected 
cDNA or genomic libraries using the deduced amino acid sequence disclosed herem for the 
20 first time, and, if necessary, using conventional primer extension procedures as described in 
Sambrook et al. . supra , to detect precursors and processing intermediates of mRNA that may 
not have been reverse-transcribed into cDNA. 

2. Selection and Transformation of Host Cells 
25 Host cells are transfected or transformed with expression or cloning vectors described 

herein for PR0874 polypeptide production and cultured in conventional nutrient media 
modified as appropriate for inducing promoters, selecting transformants, or amplifying the 
genes encoding the desired sequences. The culture conditions, such as media, temperature, 
pH and the like, can be selected by the skilled artisan without undue experimentation. In 
30 general, principles , protocols , and practical techniques for maximizmg the productivity of cell 
cultures can be found in Mammalian Cell Biotechno logy : a Pract ical Approach . M, Butler, 
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ed. (IRL Press, 1991) and Sambrook et al., ss«2Ca. 

Methods of transfectionare known to the ordinarily skUled artisan, for example, CaP04 
and electroporation. Depending on the host cell used, transformation is performed using 
standard techniques appropriate to such cells. The calcium treatment employing calcium 
chloride, as described in Sambrook et al., su pra , or electroporation is generally used for 
5 prokaryotes or other cells that contain substantial cell-wall barriers. Infection with 
Agrobacterium tumefaciens is used for transformation of certain plant cells, as described by 
Shaw et al., Gene . 21:315 (1983) and WO 89/05859 published June 29. 1989. For 
mammalian cells without such cell walls, the calcium phosphate precipitation method of 
Graham and van der Eb, Virology . 52:456^57 (1978) can be employed. General aspects of 
10 mammalian cell host system transformations have been described in U.S. Patent No. 
4,399,216. Transformations into yeast are typically carried out according to the method of 
Van Solingen et al. , J- Bact. . 130:946 (1977) and Hsiao et al. , Prpc, Natl. Acad, gpi- (USA) , 
76:3829 (1979). However, other methods for introducing DNA into cells, such as by nuclear 
microinjection, electroporation, bacterial protoplast fusion with intact cells, or polycations, 
15 e.g. , polybrene, polyomittiine, may also be used. For various techniques for transformmg 
mammalian cells, see Keown et al., A/fpthnris in Rnzvmologv. lSS:527-537 (1990) and 
Mansour et al.. Natlttfi, 236:348-352 (1988). 

Suitable host cells for cloning or expressing the DNA in the vectors herein include 
prokaryote, yeast, or higher eukaryote cells. Suitable prokaryotes include but are not limited 
20 to eubacteria, such as Gram-negative or Gram-positive organisms, for example, 
Enterobacteriaceaesuch as E. coU. Various E. coli strains are publicly available, such as E. 
coli K12 strain MM294 (ATCC 31,446); E. coli Xm6 (ATCC 31 ,537); E. coU strain W3U0 
(ATCC 27,325) and K5 772 (ATCC 53,635). 

In addition to prokaryotes, eukaryotic microbes such as fdamentous fungi or yeast are 
25 suitable cloning or expression hosts for PR0874-encoding vectors. Saccharomyces ceremiae 
is a commonly used lower eukaryotic host microorganism. 

Suitable host cells for the expression of glycosylated PR0874 are derived from 
multicellular organisms. Examples of invertebrate cells include insect cells such as Drosophila 
S2 and Spodoptera Sf9, as well as plant cells. Examples of useful mammalian host cell lines 
30 include Chinese hamster ovary (CHO) and COS cells. More specific examples mclude 
monkey kidney CVl line transformed by SV40 (COS-7, ATCC CKL 1651); human embryonk: 
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kidney line (293 or 293 cells subcioned for growth in suspension culture, Graham et al., L 
nen Virol ■ ■ M:59 (1977)); Chinese hamster ovary cellsZ-DHFR (CHO, Urlaub and Chasin, 
Prnr. Natl. Acad. Sci. USA . 22:4216 (1980)); mouse Sertoli cells (TM4, Mather, BioL 
Reprod. . 23:243-251 (1980)); human lung cells (W138, ATCC CCL 75); human liver cells 
(Hep G2, HB 8065); and mouse mammary tumor (MMT 060562, ATCC CCL51). The 
5 selection of the appropriate host cell is deemed to be within the skill in the art. 

3. Selection and TTse of a Re plicahle Vector 
The nucleic acid {e.g., cDNA or genomic DNA) encoding the desired PR0874 
polypeptide may be inserted into a replicable vector for cloning (amplification of the DNA) 
10 or for expression. Various vectors are publicly available. The vector may, for example, be 
in the form of a plasmid, cosmid, viral particle, or phage. The appropriate nucleic acid 
^ sequence may be inserted into the vector by a variety of procedures. In general, DNA is 
3; inserted into an appropriate restriction endonuclease site(s) using techniques known in the art. 
% Vector components generally include, but are not limited to, one or more of a signal sequence, 
!ii;i5 an origin of replication, one or more marker genes, an enhancer element, a promoter, and a 
" transcriptioi termination sequence. Construction of suitable vectors containing one or more 
iii of these components employs standard ligation techniques which are known to the skilled 
artisan. 

■0 The desired PR0874 polypeptide may be produced recombinantly not only directly, 

20 but also as a fusion polypeptide with a heterologous polypeptide, which may be a signal 
sequence or other polypeptide having a specific cleavage site at the N-terminus of the mature 
protein or polypeptide. In general, the signal sequence may be a component of the vector, or 
it may be a part of the PR0874-encoding DNA that is inserted into the vector. The signal 
sequence may be a prokaryotic signal sequence selected, for example, from the group of the 
25 alkaline phosphatase, penicillinase, Ipp, or heat-stable enterotoxin II leaders. For yeast 
secretion the signal sequence may be, e.g., the yeast invertase leader, alpha factor leader 
(including Saccharomyces and Khtyveromyces a-factor leaders, the latter described in U.S. 
Patent No. 5,010,182), or acid phosphatase leader, the C. albicans glucoamylase leader (EP 
362,179 published 4 April 1990), or the signal described in WO 90/13646 published 15 
30 November 1990. In manmialian cell expression, mammalian signal sequences may be used 
to direct secretion of the protein, such as signal sequences from secreted polypeptides of the 
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same or related species, as well as viral secretory leaders. 

Both expression and cloning vectors contain a nucleic acid sequence that enables the 
vector to replicate in one or more selected host cells. Such sequences are well known for a 
variety of bacteria, yeast, and viruses. The origin of replication from the plasmid pBR322 is 
suitable for most Gram-negative bacteria, the 2^1 plasmid origin is suitable for yeast, and 
5 various viral origins (SV40. polyoma, adenovirus. VSV or BPV) are usefld for cloning vectors 
in mammalian cells. 

Expression and cloning vectors will typically contain a selection gene, also tenned a 
selectable marker. Typical selection genes encode proteins that (a) confer resistance to 
antibiotics or other toxins, e.g., ampicillin, neomycin, methotrexate, or tetracycline, (b) 
10 complement auxotrophic deficiencies, or (c) supply critical nutrients not available from 

complex media, e.g. , the gene encoding D-alanine racemase for Bacilli. 
\ An example of suitable selectable markers for mammalian cells are those that enable 

' the identification of ceUs competent to take up the PR0874-encoding nucleic acid, such as 
DHFR or thymidine kinase. An appropriate host ceU when wild-type DHFR is employed is 
1 15 the CHO ceU line deficient in DHFR activity, prepared and propagated as described by Urlaub 
et al. . Prnn M=»ti Acad. Sci. USA . 22:4216 (1980). A suitable selection gene for use in yeast 
I is the trpl gene present in the yeast plasmid YRp7 [Stinchoomb et al. , iMuifi. 232:39 (1979); 
; Kingsman et al. , Gsm, 1: 141 (1979); Tschemper et al. , Gaae, IQ: 157 (1980)]. The trpl gene 
provides a selection marker for a mutant stram of yeast lacking the ability to grow in 
' 20 tryptophan, for example. ATCC No. 44076 or PEP4-1 [Jones, Genetics . 55:12 (1977)]. 

Expression and cloning vectors usually contain a promoter operably linked to the 
PR0874-encoding nucleic acid sequence to direct mRNA synthesis. Promoters recognized by 
a variety of potential host cells are well known. Promoters suitable for use with prokaryotic 
hosts include the p-lactamase and lactose promoter systems [Chang et al., Namre . 275:615 
25 (1978); Goeddel et al.. Nature . m:544 (1979)], alkaline phosphatase, a tryptophan (trp) 
promoter system [Goeddel, Nnr.leir. Acids Res. . 8:4057 (1980); EP 36,776], and hybrid 
promoters such as the tac promoter [deBoer et al., Proc. Natl. Ac^d. Sci. USA , SQ;21-25 
(1983)]. Promoters for use in bacterial systems also will contain a Shine-Dalgarno (S.D.) 
sequence operably linked to the DNA encoding the PR0874 polypeptide. 
30 Examples of suitable promoting sequences for use with yeast hosts include tiie 

promoters for 3-phosphoglycerate kinase [Hitzeman et al., J, j&ipl. Chem,., 255:2073 (1980)] 
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or other glycolytic enzymes [Hess et al., T Mv Reg,, 2:149 (1968); Holland. 

Rinchemistrv . 12:4900 (1978)], Such as enolase, glyceraldehyde-3-phosphate dehydrogenase, 
hexokinase, pyruvate decarboxylase. phosphofructokinase,glucose-6-phosptete isomerase, 3- 
phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphogiucose 
isomerase, and glucokinase. 

Other yeast promoters, which are inducible promoters having the additional advantage 
of transcription controlled by growth conditions, are the promoter regions for alcohol 
dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes associated with 
nitrogen metabolism, metallothionein, gIyceraldehyde-3 -phosphate dehydrogenase, and 
enzymes responsible for maltose and galactose utilization. Suitable vectors and promoters for 
use in yeast expression are further described in EP 73,657. 

PR0874 transcriptionfirom vectors in mammalian host cells is controlled, for example, 
by promoters obtained from the genomes of viruses such as polyoma virus, fowlpox virus (UK 
2,21 1,504 published 5 July 1989), adenovirus (such as Adenovirus 2), bovine papilloma virus, 
avian sarcoma virus, cytomegalovirus, a retrovirus, hepatitis-B virus and Simian Virus 40 
(SV40), from heterologous mammalian promoters, e.g., the actin promoter or an 
immunoglobulin promoter, and from heat-shock promoters, provided such promoters are 
compatible with the host cell systems. 

Transcriptionof a DNA encoding a PR0874 polypeptide by higher eukaryotes may be 
increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements 
of DNA, usually about from 10 to 300 bp, that act on a promoter to increase its transcription. 
Many enhancer sequences are now known from raanmialian gen^ (globin, elastase, albumin, 
a-fetoprotein, and insulin). Typically, however, one will use an enhancer from a eukaryotic 
cell virus. Examples include the SV40 enhancer on the late side of the replication origin (bp 
100-270), the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side 
; of the replication origin, and adenovirus enhancers. The enhancer may be spliced into the 
vector at a position 5' or 3' to the PR0874 coding sequence, but is preferably located at a site 
5 ' from the promoter. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, 
human, or nucleated cells from other muUicellular organisms) will also contain sequences 
) necessary for the termination of transcription and for stabilizing the mRNA. Such sequences 
are commonly available from the 5' and, occasionally 3', unfranslated regions of eukaryotic 
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or viral DNAs or cDNAs. These regions contain nucleotide segments transcribed ; 
polyadenylated fragments in the untranslated portion of the mRNA encoding PR0874. 

Still other methods, vectors, and host ceUs suitable for adaptation to the synthesis 
PR0874 polypeptides in recombinant vertebrate cell culliire are described in Gething et a 
Nature, 221:620-625 (1981); Mantei et al.. Nature , 281:4(M6 (1979); EP 117.060; and 1 
5 117,058. 



4, nfttftctin^ Gene Amplificarion/Exprcssjon 
Gene amplification and/or expression may be measured in a sample directly, for 
example, by conventional Southern blotting. Northern blotting to quantitate the transcription 
of mRNA [Thomas. ^ ^-^^ Ar..d Sci. USA . 22:5201-5205 (1980)], dot blotting (DNA 
analysis), or in situ hybridization, using an appropriately labeled probe, based on the 
sequences provided herein. Alternatively, antibodies may be employed that can recognize 
specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes 
or DNA-proteinduplexes. The antibodies in turn may be labeled and the assay may be carried 
out where the duplex is bound to a surface, so that upon the formation of duplex on the 
surface, the presence of antibody bound to the duplex can be detected. 

Gene expression, alternatively, may be measured by immunological methods, such as 
immunohistocheraical staining of ceUs or tissue sections and assay of cell culture or body 
fluids, to quantitate directly the expression of gene product. Antibodies useful for 
immunohistochemical staining and/or assay of sample fluids may be either monoclonal or 
polyclonal, and may be prepared in any mammal. Conveniently, the antibodies may be 
prepared against a native sequence PR0874 polypeptide or against a synthetic peptide based 
on the DNA sequences provided herein or against exogenous sequence fused to PR0874- 
encoding DNA and encoding a specific antibody epitope. 



5. PiirifipatinnofPolvp( 
Forms of PR0874 may be recovered from culture medium or from host cell lysates. 
If membrane-bound, it can be released from the membrane using a suitable detergent solution 
(e.g. Triton-X 100) or by enzymatic cleavage. Cells employed in expression of PR0874 
polypeptides can be disrupted by various physical or chemical meaiK, such as freeze-thaw 
cycling, sonication, mechanical disruption, or cell lysing agents. 
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It may be desired to purify PR0874 from recombinant cell proteins or polypeptides. 
The following procedures are exemplary of suitable purification procedures: by fractionation 
on an ion-exchange column; ethanol precipitation; reverse phase HPLC; chromatography on 
silica or on a cation-exchange resin such as DEAE; chromatofocusing; SDS-PAGE; 
ammonium sulfate precipitation; gel filtration using, for example, Sephadex G-75; protein A 

5 Sepharose columns to remove contaminants such as IgG; and metal chelating columns to bind 
epitope-tagged forms of the PR0874 polypeptide. Various methods of protem purification 
may be employed and such methods are known in the art and described for example in 
n^.,tc,-h^r, Mf^thfvls in RnTrvmoloev. 1^2 (1990); Scopes, Protein Purification:Principles and 
Practice . Springer-Verlag, New York (1982). The purification step(s) selected will depend, 

10 for example, on the nature of the production process used and the particular PR0874 
polypeptide produced. 

R TTse.sforPR0874 

Nucleotide sequences (or their complement) encoding PR0874 polypeptides have 

15 various applications in the art of molecular biology, including uses as hybridization probes, 
in chromosome and gene mapping and in die generation of anti-sense RNA and DNA. 
PR0874-encoding nucleic acid will also be usefiil for the preparation of PR0874 polypeptides 
by the recombinant techniques described herein. 

The DNA40621-1440nucleotide sequence (SEQ ID NO: 1) or the nucleotide sequence 

20 encoding PR0874 (SEQ ID NO:2). or portions thereof, may be used as hybridization probes 
for a cDNA library to isolate the full-length PR0874 gene or to isolate still other genes (for 
instance, those encoding naturally-occurring variants of PR0874 or PR0874 from other 
species) which have a desired sequence identity to the PR0874 nucleotide sequence disclosed 
in Figure 1 {SEQ ID N0:1). Optionally, the length of the probes will be about 20 to about 

25 50 bases. The hybridization probes may be derived from the UNQ441 (DNA40621-1440) 
nucleotide sequence of SEQ ID NO:l as shown in Figure 1 or from genomic sequences 
including promoters, enhancer elements and introns of native sequence PR0874-encoding 
DNA. By way of example, a screening method will comprise isolatmg the coding region of 
the PR0874 gene using the known DNA sequence to synthesize a selected probe of about 40 

30 bases. Hybridizationprobcs may be labeled by a variety of labels, including radionucleotides 
such as ^^P or '^S, or enzymatic labels such as alkaline phosphatase coupled to the probe via 
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avidin^iotincoupling systems. Labeled probes having a sequence complementary to that of 
the PR0874 gene of the present invention can be used to screen libraries of human cDNA, 
genomic DNA or mRNA to determine which members of such libraries the probe hybridizes 
to. Hybridization techniques are described in further detail in the Examples below. 

The probes may also be employed in PGR techniques to generate a pool of sequences 
for identification of closely related PR0874 sequences. 

Nucleotide sequences encoding a PR0874 polypeptide can also be used to construct 
hybridization probes for mapping the gene which encodes that PR0874 polypeptide and for 
the genetic analysis of individuals with genetic disorders. The nucleotide sequences provided 
herein may be mapped to a chromosome and specific regions of a chromosome using known 
techniques, such as in situ hybridization, linkage analysis against known chromosomal 
markers, and hybridization screening with libraries . 
1 When the coding sequences for PR0874 encode a protein which binds to another 

protein (example, where the PR0874 polypeptide functions as a receptor), the PR0874 
polypeptide can be used in assays to identify the other proteins or molecules involved in the 
binding interaction. By such methods, inhibitors of the receptor/ligand binding interaction can 
be identified. Proteins involved in such binding interactions can also be used to screen for 
peptide or small molecule inhibitors or agonists of the binding interaction. Also, the receptor 
PR0874 polypeptide can be used to isolate correlative ligand(s). Screening assays can be 
designed to find lead compounds that mimic the biological activity of a native PR0874 or a 
receptor for PR0874. Such screening assays will inchide assays amenable to high-throughput 
screening of chemical libraries, making them particularly suitable for identifying small 
molecule drug candidates. Small molecules contemplated include synthetic organic or 
inorganic compounds. The assays can be performed in a variety of formats, including proteinr 
protein binding assays, biochemical screening assays, immunoassays and cell based assays, 
which are well characterized in the art. 

Nucleic acids which encode PR0874 polypeptide or any of its modified forms can also 
be used to generate either transgenic animals or "knock out" animals which, in turn, are useful 
in the development and screening of therapeutically useful reagents. A transgenic animal 
(e.g., a mouse or rat) is an animal having cells that contain a transgene, which transgene was 
introduced into the animal or an ancestor of the animal at a prenatal, e.g. , an embryonic stage. 
A transgene is a DNA which is integrated into the genome of a cell from which a transgenic 
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animal develops. In one embodiment, cDNA encoding PR0874 polypeptide can be used to 
clone genomic DNA encoding PR0874 in accordance with established techniques and the 
genomic sequences used to generate transgenic animals that contain cells which express DNA 
encoding PR0874. Methods for generating transgenic animals, particularly anunals such as 
mice or rats, have become conventional in the art and are described, for example, in U.S. 
Patent Nos. 4,736,866 and 4,870,009. Typically, particular cells would be targeted for 
PR0874 transgene incorporation with tissue-specific enhancers. Transgenic animals that 
include a copy of a transgene encoding PR0874 introduced into the germ line of the animal 
at an embryonic stage can be used to examine the effect of increased expression of DNA 
encoding PR0874. Such animals can be used as tester animals for reagents thought to confer 
protection from, for example, pathological conditions associated with its overexpression. In 
accordance with this facet of the invention, an animal is treated with the reagent and a reduced 
incidence of the pathological condition, compared to untreated animals bearing the transgene, 
would indicate a potential therapeutic intervention for the pathological condition. 

Alternatively, non-human homologues of PR0874 can be used to construct a PR0874 
• "knock out" anhnal which has a defective or altered gene encoding PR0874 as a result of 
homologous recombination between the endogenous gene encoding PR0874 and altered 
genomic DNA encoding PR0874 introduced into an embryonic cell of the animal. For 
example, cDNA encoding PR0874 can be used to clone genomic DNA encoding PR0874 in 
accordance with estaWished techniques. A portion of the genomic DNA encoding PR0874 
3 can be deleted or replaced with another gene, such as a gene encoding a selectable marker 
which can be used to monitor integration. Typically, several kilobases of unaltered fiankmg 
DNA (both at the 5' and 3' ends) are included in the vector [see e.g., Thomas and Capecchi. 
Cell . 51:503 (1987) for a description of homologous recombination vectors]. The vector is 
introduced into an embryonic stem cell line (e.g., by electroporation) and cells in which the 
5 introduced DNA has homologously recombined with the endogenous DNA are selected [see 
e.g., Li et al., Cell, ^:915 (1992)]. The selected cells are then injected into a blastocyst of 
an animal (e.g., a mouse or rat) to form aggregation chimeras [see e.g., Bradley, in 
Teratocarcinomas and Embryonic Stem Cells: A Practical Approach, E. J. Robertson, ed. 
(IRL, Oxford, 1987), pp. 1 13-152]. A chimeric embryo can then be implanted into a suitable 
10 pseudopregnant female foster animal and the embryo brought to term to create a "knock out" 
animal. Progeny harboring the homologously recombined DNA in their germ cells can be 
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identified by standard techniques and used to breed animals in which all cells of the animal 
contain the homologously recombined DNA. Knockout animals can be characterized for 
instance, for their ability to defend against certain pathological conditions and for their 
development of pathological conditions due to absence of the PR0874 polypeptide. 

PR0874 polypeptides of the present invention may be employed both in vivo for 
5 therapeutic purposes and in vitro. Those of ordinary skill in the art will well know how to 
employ tiie PR0874 polypeptides of the present invention for such purposes. 

F, An^i-PT?r>«74 Antibodies 
The present invention farther provides anti-PR0874 polypeptide antibodies. 
10 Exemplary antibodies include polyclonal, monoclonal, humanized, bispecific. and 
heteroconjugate antibodies. 

1. PnlYrlnnal ATtfibodics 
The anti-PR0874 antibodies of the present invention may comprise polyclonal 

15 antibodies. Methods of preparing polyclonal antibodies are known to the skilled artisan. 
Polyclonal antibodies can be raised in a mammal, for example, by one or more injections of 
an immunizing agent and, if desired, an adjuvant. Typically, the immunizing agent and/or 
adjuvant will be injected in the mammal by multiple subcutaneous or intraperitoneal injections. 
The mmiunizing agent may include the PR0874 polypeptide or a fiasion protein thereof. It 

20 may be useful to conjugate the immunizing agent to a protein known to be immunogenic in the 
mammal being immunized. Examples of such immunogenic proteins include but are not 
limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean 
trypsin inhibitor. Examples of adjuvants which may be employed include Freund's complete 
adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 

25 dicorynomycolate). The immunization protocol may be selected by one skilled in the art 
without undue experimentation. 

2. Mnncvlnnal Anribodies 
The anti.-PR0874 antibodies may , alternatively , be monoclonal antibodies . Monoclonal 
30 antibodies may be prepared using hybridoma methods, such as those described by Kohler and 
Milstein, Nanire . 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 
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appropriate host animal, is typically immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producmg antibodies that wiU specifically bind to 
the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. 

The immunizing agent will typically include the PR0874 polypeptide or a fusion 
protein thereof. Generally, either peripheral blood lymphocytes ("PBLs") are used if ceUs of 
5 human origin are desired, or spleen cells or lymph node cells are used if non-human 
mammalian sources are desired. The lymphocytes are then fused with an immortalized cell 
luie using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell 
[Coding. Mnnnclonal Aptihorlies: Princinles aiid Practice . Academic Press, (1986) pp. 59- 
103] . Immortalized cell lines are usually transformed mammalian cells, particularly myeloma 
10 cells of rodent, bovine and human origin. Usually, rat or mouse myeloma cell lines are 
m employed. The hybridoma cells may be cultured in a suitable culture medium that preferably 
li contams one or more substances that inhibU the growth or survival of the unfused, 
i immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine 
p phosphoribosyl transferase (HGPRT or HPRT), the culmre medhmi for the hybridomas 
1^ 15 typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which 

substances prevent the growth of HGPRT-deficient cells. 
m Preferred immortalized cell lines are those that fuse efficiently, support stable high 

level expression of antibody by the selected antibody-producing ceUs, and are sensitive to a 
medium such as HAT medium. More preferred immortalized cell lines are murine myeloma 
lines, which can be obtained, for instance, from the Salk Institute Cell Distribution Center, San 
Diego, California and the American Type Culture Collection, Rockville, Maryland. Human 
myeloma and mouse-human heteromyeloma cell lines also have been described for the 
production of human monoclonal antibodies [Kozbor, .T. ImmuPOl,. 122:3001 (1984); Brodeur 
et al. , Monocloml Antihndv Produc t^^n Techniques and Annlications. Marcel Dekker, Inc. , 
New York, (1987) pp. 51-63]. 

The culmre medium in which the hybridoma cells are culmred can then be assayed for 
the presence of monoclonal antibodies directed against a PR0874 polypeptide . Preferably , the 
binding specificity of monoclonal antibodies produced by the hybridoma cells is determined 
by immunoprecipitafion or by an in vitro binding assay, such as radioimmunoassay (RIA) or 
enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in 
the art. The binding affinity of the monoclonal antibody can, for example, be determined by 
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the Scatchard analysis of Munson and Pollard, Anal. Pwchgtn., 101:220 (1980). 

After the desired hybridoma ceUs are identified, the clones may be subcloned by 
limiting dilution proceduresand grown by standard methods [Coding, supial. Suitable culture 
media for this purpose include, for example, Dulbecco's Modified Eagle' s Medium and RPMI- 
1640 medium. Alternatively, the hybridoma ceUs may be grown in vivo as ascites in a 
mammal. 

The monoclonal antibodies secreted by the subclones may be isolated or purified from 
the culnire medium or ascites fluid by conventional immunoglobulin purification procedures 
such as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel 
electrophoresis, dialysis, or affinity chromatography. 
) The monoclonal antibodies may also be made by recombinant DNA methods, such as 

those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of 
the invention can be readily isolated and sequenced using conventional procedures (e.g., by 
using oligonucleotide probes that are capable of binding specifically to genes encoding the 
heavy and light chains of murine antibodies). The hybridoma cells of the mvention serve as 
5 a preferred source of such DNA. Once isolated, the DNA may be placed into expression 
vectors, which are then transfected mto host cells such as simian COS cells, Chinese hamster 
ovaiy (CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin protein, 
to obtain the synthesis of monoclonal antibodies in the recombinant host cells. The DNA also 
may be modified, for example, by substimting the coding sequence for human heavy and light 
20 chain constant domains in place of the homologous murine sequences [U.S. Patent No. 
4,816,567; Morrison et al., V^^ ^"^^ An.d, Sci. USA SI, 6851-6855 [19841] or by 
covalently joining to the immunoglobulin coding sequence all or part of the coding sequence 
for a non-immunoglobulin polypeptide. Such a non-immunoglobulin polypeptide can be 
substimted for the constant domains of an antibody of the invention, or can be substitoted for 
25 the variable domams of one antigen-combining site of an antibody of the invention to create 
a chimeric bivalent antibody. 

The antibodies may be monovalent antibodies. Methods for preparing monovalent 
antibodies are well known in the art. For example, one method involves recombinant 
expression of immunoglobulin light chain and modified heavy chain. The heavy chain is 
30 truncated generally at any point in the Fc region so as to prevent heavy chain crosslinking. 
Alternatively, the relevant cysteine residues are substituted with another amino acid residue 
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or are deleted so as to prevent crosslinking. 

In vitro methods are also suitable for preparing monovalent antibodies. Digestion of 
antibodies to produce ftagments thereof, particularly. Fab fragments, can be accomplished 
using routine techniques known in the art. 



3. Hum^njige^ A" 

The anti-PR0874 antibodies of the invention may further comprise humanized 
antibodies or human antibodies. Humanized forms of non-human (e.g.. murine) antibodies 
are chimeric immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv. Fab. 
Fab', F(ab')2 or other antigen-binding subsequences of antibodies) which contain minimal 
sequence derived from non-human immunoglobulin. Humanized antibodies include human 
immunoglobulins (recipient antibody) in which residues from a complementary determining 
region (CDR) of the recipient are replaced by residues from a CDR of a non-human species 
(donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and 
capacity. In some instances, Fv framework residues of the human immunoglobulin are 
replaced by corresponding non-human residues. Humanized antibodies may also comprise 
residues which are found neither in the recipient antibody nor in the imported CDR or 
framework sequences. In general, the humanized antibody will comprise substantially all of 
at least one, and typically two, variable domains, in which all or substantially all of the CDR 
regions correspond to those of a non-human immunoglobulin and all or substantially all of the 
FR regions are those of a human immunoglobulin consensus sequence. The humanized 
antibody optimally also will comprise at least a portion of an immunoglobdin constant region 
(Fc), typically that of a human immunoglobulin [Jones et al., Nature , 221:522-525 (1986); 
Riechmami et al. , Nature, 232:323-329 (1988); and Presta, Oirr. Op, Struct. Biol, , 2:593-596 
(1992)]. 

i Methods for humanizing non-human antibodies are well known in the art. Generally, 

a humanized antibody has one or more amino acid residues introduced into it from a source 
which is non-human. These non-human amino acid residues are often referred to as "import" 
residues, which are typically taken from an "import" variable domain. Humanization can be 
essentially performed following the method of Winter and co-workers [Jones et al.. Nature , 

) 321:522-525 (1986); Riechmann et al., Natore . 222:323-327 (1988); Verhoeyen et al.. 
Science . 222:1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the 
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coixesponding sequences of a human antibody. Accordingly, such "humanized" antibodies 
are chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact 
human variable domain has been substimtedby the corresponding sequence ftomanoi>-human 

species. In practice, humanized antibodies are typically human antibodies in which some CDR 

residues and possibly some FR residues are substituted by residues from analogous sites in 
5 rodent antibodies. 

Human antibodies can also be produced using various techniques known in the art, 

including phage display libraries [Hoogenboom and Winter, T. Mol . Biol . 222:381 (1991); 

Marks etal., T Mol. Biol. . 222:581 (1991)]. Thetechniquesof Cole etal. and Boemeretal. 

are also available for the preparation of human monoclonal antibodies (Cole et al . , MonQSlSiml 
10 A"^'hnHies and ^''^^^ Theraov . Alan R. Liss, p. 77 (1985) and Boemer et al. , L-lnummoL, 

147fn :86-95 (1991)]. 

4. tt^^pprific Antibodies 
Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that 
15 have binding specificities for at least two different antigens. In the present case, one of the 
binding specificities is for a PR0874 polypeptide, the other one is for any other antigen, and 
preferably for a cell-surface protein or receptor or receptor subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the 
recombinant production of bispecific antibodies is based on the co-expression of two 
20 immunoglobulm heavy-chain/light-chain pairs, where the two heavy chains have different 
specificities [Milstein and CueUo, NalH£, 3QS:537-539 (1983)]. Because of the random 
assortment of immunoglobulm heavy and light chains, these hybridomas (quadromas) produce 
a potential mixmre of ten different antibody molecules, of which only one has the correct 
bispecific strucmre. The purification of the correct molecule is usually accomplished by 
25 affinity chromatography steps. Similar procedures are disclosed in WO 93/08829. published 
13 May 1993, and in Traunecker et al., mEOi, 111:3655-3659 (1991). 

Antibody variable domains with the desired binding specificities (antibody-antigen 
combining sites) can be fused to inmiunoglobuHn constant domain sequences. The fusion 
preferably is with an mmiunoglobulin heavy-chain constant domain, comprising at least part 
30 of the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant 
region (CHI) containing the site necessary for light-chain binding present in at least one of the 
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fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the 
immunoglobuUn light chain, are inserted into separate expression vectors, and are co- 
transfected into a suitable host organism. For further details of generating bispecific 
antibodies see, for example, Suresh et al., MfTtinds in FrTlzyniftlPRY, 121:210 (1986). 

5 5. HptftrncnniueatR Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. 
Heteroconjugateantibodies are composed of two covalently joined antibodies. Such antibodies 
have, for example, been proposed to target immune system cells to unwanted cells [U.S. 
Patent No. 4,676,980], and for treatmem of HIV infection [WO 91/00360; WO 92/200373; 
10 EP 03089]. It is contemplated that the antibodies may be prepared in vitro using known 
methods in synthetic protein chemistry, including those involving crosslinking agents. For 
M example, immunotoxinsmay be constructed using a disulfide exchange reaction or by fDrmiqg 
£^ a thioether bond. Examples of suitable reagents for this purpose include iminothiolate and 
i methyl-4-mercaptobutyrimidate and those disclosed, fer example, in U.S. Patent No. 
g 15 4,&76,980. 

m G. ^ Tsi.fi for anti-PT?0^7^ Antibodies 

% The anti-m)874 antibodies of the present invention have various utilities. For 

5 example, anti J»R0874 antibodies may be used in diagnostic assays for PR0874 polypeptides. 
20 e.g., detecting expression in specific cells, tissues, or serum. Various diagnostic assay 
techniques known in the art may be used, such as competitive binding assays, direct or indirect 
sandwich assays and immunoprecipitation assays conducted in either heterogeneous or 
homogeneous phases [Zola, Mnnoclpnal Antihotlies: A Man^ifl l of Techniques , CRC Press. 
Inc. (1987) pp. 147-158]. The antibodies used m the diagnostic assays can be labeled with a 
25 detectable moiety. The detectable moiety should be capable of producing, either directly or 
indirectly, a detectable signal. For example, the detectable moiety may be a radioisotope, such 
as ^H, '*C, ^*S, or '^I, a fluorescert or chemiluminescent compound, such as fluorescein 
isothiocyanate, rhodamine. or luciferin, or an enzyme, such as alkaline phosphatase, beta- 
galactosidase or horseradish peroxidase. Any method known in the art for conjugating the 
30 antibody to the detectable moiety may be employed, including those methods described by 
Hunter et al., Namre . 144:945 (1962); David et al.. Biochemistry, 11:1014 (1974); Pain et 
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al., T Tn.mnnol.Meth. . 4Q:219 (1981); and Nygren. T HisT<x:nem. and QytQcbm. , 22:407 
(1982). 

Anti-PR0874 antibodies also are useful for the affinity purification of PR0874 
polypeptides from recombinant cell culture or natural sources. In this process, the antibodies 
against a PR0874 polypeptide are hnmobilized on a suitable support, such a Sephadex lesin 
5 or filter paper, using methods well known in the art. The immobilized antibody then is 
contacted with a sample containing the PR0874 polypeptUe to be purified, and thereaflsr the 
support is washed with a suitable solvent that will remove substantially all the material in the 
sample except the PR0874 polypeptide, which is bound to the immobilized antibody. Finally, 
the support is washed with another suitable solvent that will release the PR0874 polypeptide 
10 from the antibody. 

The following examples are offered for illustrative purposes only, and are not intended 
to limit the scope of the present invention in any way. 

All patent and literature references cited in the present specification are hereby 
incorporated by reference in their entirety. 
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BXAMFLgS 

Commercially available reagents referred to in the examples were used according to 
manufacturer's instructions unless otherwise indicated. The source of those cells identified 
in the following examples, and throughout the specification, by ATCC accession numbers is 
20 the American Type Culhire Collection, Rockville, Maryland. 

F.XAMPLE 1 - T^»i=>tinn nf rPNA Clones Bncodinp Human PR0874 

The extracellular domain (BCD) sequences (including the secreUon signal, if any) of 
from about 950 known secreted proteins from the Swiss-ProtpubUc protein database were used 

25 to search expressed sequence tag (EST) databases. The EST databases included public EST 
databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte 
Pharmaceuticals, Palo Alto, CA). The search was performed using the computer program 
BLAST or BLAST2 (Altshul et al., MptTiods in Enzvmology, 266:460-480 (1996)) as a 
comparison of the ECD protein sequences to a 6 frame translation of the EST sequence. 

30 Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did 
not encode known proteins were clustered and assembled into consensus DNA sequences with 
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the program "phrap" (PhU Green. University of Washington, Seattle. Washington; 
http://bozeinan.nibt.washington.edu/phrap.docs/phrap.htral). 

A consensus DNA sequence was identified using the above described analysis 
as encoding a potential secreted protein. The consensus sequence, designated herein and in 
Figure 3 as DNA36459.initsequence, was then extended using repeated cycles of BLAST and 
5 phrap to extend the sequence as far as possible using the sources of EST sequences discussed 
above. The extended assembly sequence is referred to in the alignment figure shown in Figure 
3, as "consenOl", and is also referred to herein as ''DNA36459". The double-stranded 
sequence of DNA36459 (SEQ ID NO:4) is shown in Figure 4. 

Based on the DNA36459 consensus sequence shown in Figure 4, oligonucleotides were 
10 synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 
2) for use as probes to isolate a clone of the coding sequence for PR0874. Forward and 
reverse PCR primers generally range from 20 to 30 nucleotides and are often designed to give 
a PCR product of about 100-1000 bp in length. The probe sequences are typically 40-55 bp 
in length. In some cases, additional oligonucleotides are synthesized when the consensus 
15 sequence is greater than about 1-1 .5kbp. In order to screen several libraries for a clone, DNA 
from the libraries was screened by PCR amplification, as per Ausubel et al. . Cwrrgnt Protocols 
j i^ Mniftnilar Biology , with the PCR primer pair. A positive library was then used to isolate 
clones encoding the gene of interest using the probe oligonucleotide and one of the primer 
pairs. 

20 PCR primers (forward and reverse) were synthesized: 

f^^y.rHPr'R printer n6459.fn 5'-TCGTGCCCAGGGGCTGATGTGC-3' (SEQIDNO:5); 

and 

r>.vP.r.e PCR nrimer (^M59.t1) 5'-GTCTTTACCCAGCCCCGGGATGCG-3' (SEQIDNO:6). 

25 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the 
consensus DNA36459 sequence which had the fallowing nucleotide sequence: 
hyhriHiyfttion probe n6459.pn 

5*-GGCCTAATCCARCGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGG-3' (SEQ ID 
NO:7). 

30 In order to screen several libraries for a source of a clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library 
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was then used to isolate clones encoding the PR0874 gene using the probe oligonucleotide and 

one of the PCR primers. 

RNA for constructionof the cDNA libraries was isolated from human fetal lung tissue 
(LIB25). The cDNA libraries used to isolate the cDNA clones were constructed by standard 
methods using commercially available reagents such as those from Invitrogen, San Diego, CA. 
5 The cDNA was primed with oligo dT containing a NotI site, linked with blunt to SaU 
hemikinased adaptors, cleaved with NotI, sized appropriately by gel electrophoresis, and 
cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; pRK5B 
is a precursor of pRK5D that does not contain the Sfil site; see, Hohnes et al., Seknsfi, 
222:1278-1280 (1991)) in the unique Xhol and NotI sites. 
10 DNA sequencing of the clones isolated as described above gave the DNA sequence for 

PR0874 [herein designated as UNQ441 (DNA40621-1440)] (SEQ ID N0:1) and the derived 
protein sequence for PR0874. 

The entire nucleotide sequence of UNQ441 (DNA40621-1440) is shown in Figure 1 
(SEQ ID NO:l). Clone UNQ441 (DNA4062 1-1440) contains a single open reading frame 
15 endingatthestopcodonatnucleotidepositions964-966(Figarel). The predicted polypeptide 
encoded by DNA40621-1440 is 321 amino acids long (Figure 2). The PR0874 protein shown 
in Figure 2 has an estimated molecular weight of about 36.194 daltons and a pl of about 9.85. 
Analysis of the PR0874 sequence shown in Figure 2 (SEQ ID NO:3) evidenced the presence 
of the following: a Type D transmembrane domain at about amino acids 57-80; additional 
20 transmembrane domains at about amino acids 110-126. 215-231, and 254-274; potential N- 
glycosylation sites at about amino acids 16-19. 27-30, and 289-292; sequence identity with 
hypothetical YBR002c family proteins at about amino acids 276-287; and sequence identity 
with ammonium transponer proteins at about anuno acids 204-230. 

Clone UNQ441 (DNA40621-1440) was deposited with the ATCC on June 2, 1998, and 

25 is assigned ATCC deposit no — 

Analysis of the amino acid sequence of the PR0874 polypeptide suggests that it is a 
novel multi-span transmembrane protein. However, an analysis of the Dayhoff database 
(version 35.45 SwissProt 35) evidenced sequence identity between the PR0874 amino acid 
sequence and the following Dayhoff sequences: S67049, AF054839_1, S73437. S52460, and 
30 HIVU80570_L 
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EXAMPLE 2 : TTsf^ of PROS74-gnCP' 

The following method describes use of a nucleotide sequence encoding PR0874 as a 

hybridization probe. 

DNA comprising the coding sequence of PR0874 (as shown in Figure 1, SEQ ID 
NO:l) or a fragment thereof is employed as a probe to screen for homologous DNAs (such 
as those encoding namrally-occurringvariants of PR0874) in human tissue cDNA libraries or 

human tissue genomic libraries. 

Hybridizationandwashingof filters containing either libraryDNAs is performed under 

the following high stringency conditions. Hybridization of radiolabeled PR0874polypeptide- 
derived probe to the filters is performed in a solution of 50% formamide, 5x SSC. 0. 1 % SDS, 
0. 1 % sodium pyrophosphate, 50 mM sodium phosphate, pH 6.8, 2x Denhardfs solution, and 
10% dextran sulfate at Al'C for 20 hours. Washing of the filters is performed in an aqueous 
solution of 0.1x SSC and 0.1% SDS at 42'C. 

DNAs having a desired sequence identity with the DNA encoding the PR0874 
polypeptide can then be identified using standard techniques known in the art. 

PXAMPLE 3 : P.^T^ssion o f PR0874 P ol V PeptidfiS in Wll 

This example illustrates the preparation of unglycosylated forms of PR0874 
polypeptides by recombinant expression in E. coU. 

The DNA sequence encoding the PR0874 polypeptide or a fragment or variant thereof 
D is initially amplified using selected PGR primers. The primers should contain restriction 
enzyme sites which correspond to the restriction enzyme sites on the selected expression 
vector. A variety of expression vectors may be employed. An example of a suitable vector 
is pBR322 (derived from £. coli; see Bolivar et al., G^, 2:95 (1977)) which contains genes 
for ampicillinand tetracycline resistance. The vector is digested with restriction enzyme and 
5 dephosphorylated. The PGR amplified sequences are then ligated into the vector. The vector 
will preferably include sequences which encode for an antibiotic resistance gene, a trp 
promoter, a polyhis leader (including the first six STO codons, polyhis sequence, and 
enterokinase cleavage site), the PR0874 coding region, lambda transcriptional terminator, and 
an argU gene. 

JO The ligation mixture is then used to transform a selected E. coli strain using the 

methods described in Sambrooket al., ssm- Transfonnants are identified by their ability to 
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grow on LB plates and antibiotic resistant colonies are then selected. Plasmid DNA can be 
isolated and confirmed by restriction analysis and DNA sequencing. 

Selected clones can be grown overnight in liquid culture medium such as LB broth 
supplemented with antibiotics. The overnight culture may subsequently be used to inoculate 
a larger scale culture. The cells are then grown to a desured optical density, during which the 
5 expression promoter is turned on. 

After culturing the cells for several more hours, the cells can be harvested by 
centrifixgation. The cell pellet obtained by the centrifugationcan be solubilized using various 
agents known in the art, and the solubilized PR0874 polypeptide can then be purified using 
a metal chelating column under conditions that allow tight binding of the polypeptide. 

10 

EXAMPLE 4 : Expression pf PROS74 Polypeptidef^ Maiwnalian Cells 
li This example illustrates preparation of glycosylated forms of PR0874 polypeptides by 

5^ recombinant expression in mammalian cells. 

The vector, pRK5 (see EP 307.247, published March 15, 1989), is employed as the 
HS expression vector. Optionally, the PR0874-encodingDNA is ligated into pRK5 with selected 
restriction enzymes to allow insertion of the PR0874-encoding DNA using ligation methods 
sf- such as described in Sambrook et al. , fiUEia. The resulting vector is called pRK5-PR0874. 
~ In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC 

i|3 CCL 1573) are grown to confhience in tissue culture plates in medium such as DMEM 

0 

20 supplemented with fetal calf serum and optionally, nutrient components and/or antibiotics. 
About 10 ng pRK5-PR0874 DNA is mixed with about 1 fig DNA encoding the VA RNA gene 
[Thimmappaya et al., Cell, 21:543 (1982)] and dissolved in 500 (d of 1 mM Tris-HCI, 0.1 
raM EDTA, 0.227 M CaCl^. To this mixture is added, dropwise, 500 nl of 50 mM HEPES 
(pH 7.35), 280 mM NaCl, 1.5 mM NaPOi, and a precipitate is allowed to form for 10 minutes 

25 at 25*C, The precipitate is suspended and added to the 293 cells and allowed to settle for 
about four hours at 37'C. The culture medium is aspirated off and 2 ml of 20% glycerol in 
PBS is added for 30 seconds. The 293 cells are then washed with serum free mediimi, fresh 
medium is added and the cells are incubated for about 5 days. 

Approximately 24 hours after the transfections, the culmre medium is removed and 

30 replaced with culture medium (alone) or culture medium containing 200 (iCi/ml ^*S-cysteine 
and 200 /LiCi/ml ^*S-methionine. After a 12 hour incubation, the conditioned medium is 
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collected, concentrated on a spin filter, and loaded onto a 15% SDS gel. The processed gel 
may be dried and exposed to film for a selected period of time to reveal the presence of 
PR0874 polypeptide. The cultures containing transfected cells may undergo further incubation 
(in serum free medium) and the medium is tested in selected bioassays. 

In an alternative technique, PR0874-encoding DNA may be introduced into 293 cells 
5 transiently using the dextran sulfate method described by Somparyrac et al. , Proc, Natl, Acad. 
Sci. . 12:7575 (1981). 293 cells are grown to maximal density in a spinner flask and 700 jwg 
pRK5-PR0874 DNA is added. The cells are first concentrated from the spinner flask by 
centrifiigation and washed with PBS. The DNA-dextran precipitate is incubated on the cell 
pellet for four hours. The cells are treated with 20% glycerol for 90 seconds, washed with 
10 tissue culture medium, and re-introduced into the spinner flask containing tissue culture 
medium, 5 jag/ml bovine insulin and 0. 1 ^g/ml bovine transferrin. After about four days, the 
Q conditioned media is centrifuged and filtered to remove cells and debris. The sample 
m containing expressed PR0874 polypeptide can then be concentrated and purified by any 
^^,1 selected method, such as dialysis and/or column chromatography. 

f ;|i5 In another embodiment, PR0874 polypeptide can be expressed in CHO cells. The 

r. pRK5-PR0874 vector can be transfected into CHO cells using known reagents such as CaP04 
or DEAE-dextran. As described above, the cell cultures can be incubated, and the mediiun 
replaced with culture mediimi (alone) or medium containing a radiolabel such as ^'S- 

=5- 

methionine. After detennining the presence of PR0874 polypeptide, the culture medium may 
'20 be replaced with serum free medium. Preferably, the cultures are incubated for aboiU 6 days, 
and then the conditioned medhun is harvested. The medium containing the expressed PR0874 
polypeptide can then be concentrated and purified by any selected method. 

Epitope-tagged PR0874 polypeptide may also be expressed in host CHO cells. The 
PR0874-encodingDNA may be subcloned out of the pRK5 vector. The subclone insert can 
25 undergo FCR to fuse in frame with a selected epitope tag such as a poly-his tag into a 
Baculo virus expression vector. The poly-his tagged PR0874-encoding DNA insert can then 
be subcloned into a SV40 driven vector containing a selection marker such as DHFR for 
selection of stable clones. Finally, the CHO cells can be transfected (as described above) with 
the SV40 driven vector. Labeling may be performed, as described above, to verify 
30 expression. The culture medium containing the expressed poly-His tagged PR0874 
polypeptide can then be concentrated and purified by any selected method, such as by Ni^+- 
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chelate affinity chromatography. 



FXAMPLE 5 : Kxpression of a PR0874 PoIvn gPtide in Y^ast 

The following method describes recombinant expression of PR0874 polypeptides in 

yeast. 

5 First, yeast expression vectors are constructed for intracellular production or secretion 

of PR0874 polypeptide from the ADH2/GAPDH promoter. DNA encoding the PR0874 
polypeptide of interest, a selected signal peptide and the promoter is inserted into suitable 
restriction enzyme sites in the selected plasmid to direct intracellular expression of the PR0874 
polypeptide. For secretion, DNA encoding the PR0874 polypeptide can be cloned into the 
10 selected plasmid, together with DNA encoding the ADH2/GAPDH promoter, the yeast alpha- 
factor secretory signal/leader sequence , and linker sequences (if needed) for expression of the 
n PR0874 polypeptide. 

^ Yeast cells, such as yeast strain ABllO, can then be transformed with the expression 

;r:t plasmids described above and cultured in selected fermentation media. The transformed yeast 
^=-45 supernatants can be analyzed by precipitation with 10% trichloroacetic acid and separation by 
7 SDS-PAGB, followed by staining of the gels with Coomassie Blue stain. 

Recombinant PR0874 polypeptide can subsequently be isolated and purified by 
^ removing the yeast cells from the fermentation medium by centrifiigation and then 
sj/ concentrating the medium using selected cartridge filters. The concentrate containing the 
"20 PR0874 polypeptide may further be purified using selected colunrn chromatography resins. 



EXAMPLE 6 : Expression of PROR74 Polypeptides in Baculovirus-Infected Insect Cells 

The following method describes recombinant expression of PR0874 polypeptides in 
Baculovirus-infected insect cells. 

25 The PR0874-encoding DNA is fused upstream of an epitope tag contained within a 

baculovirus expression vector. Such epitope tags include poly-his tags and immunoglobulin 
tags (like Fc regions of IgG). A variety of plasmids may be employed, including plasmids 
derived from commercially available plasmids such as pVL1393 (Novagen). Briefly, the 
PR0874-encoding DNA or the desired portion of the PR0874-encoding DNA (such as the 

30 sequence encoding the extracellular domain of a transmembrane protein) is amplified by PGR 
with primers complementary to the 5 ' and 3 ' regions . The 5 ' primer may incorporate flanking 
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(selected) restriction enzyme sites. The product is then digested with those selected restriction 
enzymes and subcloned into the expression vector. 

Recombinant baculovinis is generated by co-transfecting the above plasmid and 
BacuIoGold^" virus DNA (Pharmingen) into 5/?o<iopterc)rMgi/wr«te("Sf9") cells (ATCC CRL 
1711) using lipofectin (commercially available from GIBCO-BRL). After 4 to 5 days of 
5 incubation at 28'C, the released viruses are harvested and used for fiirther amplifications. 
Viral infection and protein expression is performed as described by O'Reilley et al, 
Baculovirus expression vectors: A laboratory Manual, Oxford; Oxford University Press (1994). 

Expressed poly-his tagged PR0874 polypeptide can then be purified, for example, by 
Ni^^ -chelate affinity chromatography as follows. Extracts are prepared from recombinant 
10 virus-infected Sf9 cells as described by Rupert et al . , Nature , M2: 175-179 (1993). Briefly, 
Sf9 cells are washed, resuspended in sonication buffer (25 mL Hepes, pH 7,9; 12.5 mM 
I MgCii; 0. 1 mM EDTA; 10% Glycerol; 0. 1 % NP-40; 0.4 M KCl), and sonicated twice for 20 
i seconds on ice. The sonicates are cleared by centrifiigation, and the supernatant is diluted 50- 
4 fold in loading buffer (50 mM phosphate, 300 mM NaCl, 10% Glycerol, pH 7.8) and filtered 
is through a 0.45 ^im filter. A Ni^*-NTA agarose column (commercially available from Qiagen) 
is prepared with a bed volume of 5 mL, washed with 25 mL of water and equilibrated with 25 
mL of loading buffer. The filtered cell extract is loaded onto the colunm at 0.5 mL per 
minute. The column is washed to baseline Ajso with loading buffer, at which point fraction 
i: collection is started. Next, the column is washed with a secondary wash buffer (50 mM 
'20 phosphate; 300 mM NaCl, 10% Glycerol, pH 6.0), which elutes nonspecifically bound 
protein. After reaching Azeo baseline again, the column is developed with a 0 to 500 mM 
Imidazole gradient in the secondary wash buffer. One mL firactions are collected and analyzed 
by SDS-PAGE and silver staining or western blot with Ni^'^-NTA-conjugated to alkaline 
phosphatase (Qiagen). Fractions containing the eluted His,o-tagged PR0874 polypeptide are 
25 pooled and dialyzed against loading buffer. 

Alternatively, purification of the IgG tagged (or Fc tagged) PR0874 polypeptide can 
be performed using known chromatography techniques, including for instance. Protein A or 
protein G column chromatography. 

30 EXAMPLE 7 : Preparation of Antibodies t hat Bind PR0874 Polypeptides 

This example illustrates the preparation of monoclonal antibodies which can specificalty 
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bind to PR0874 polypeptides. 

Techniques for producing the monoclonal antibodies are known in the art and are 
described, for instance, in Coding, sam- hnmunogens that may be employed include purified 
PR0874 polypeptide, fusion proteins containing a PR0874 polypeptide, and cells expressing 
recombinant PR0874 polypeptide on the cell surface. Selection of the imraunogen can be 
5 made by the skilled artisan without undue experimentation. 

Mice, such as Balb/c, are immunized with the PR0874 immunogen emulsified in 
complete Freund' s adjuvant and injected subcutaneously or intraperitoneally in an amount from 
1-100 micrograms. Alternatively, the immunogen is emulsified in MPL-TDM adjuvant (Ribi 
Immunochemical Research, Hamilton. MT) and injected into the animal's hind foot pads. The 
10 immunized mice are then boosted 10 to 12 days later with additional immunogen emulsified 
in the selected adjuvant. Thereafter, for several weeks, the mice may also be boosted with 
additional immunization injections. Serum samples may be periodically obtained from the 
mice by retro-orbital bleeding for testing in ELISA assays to detect anti-PR0874 polypeptide 
antibodies. 

After a suitable antibody titer has been detected, the aiumals "positive" for antibodies 
can be injected with a final intravenous injection of PR0874 polypeptide. Three to four days 
later, the mice are sacrificed and the spleen cells are harvested. The spleen cells are then 
fused (using 35% polyethylene glycol) to a selected murine myeloma cell line such as 
P3X63AgU.l, available from ATCC, No. CRL 1597. The fusions generate hybridoma cells 
20 which can then be plated in 96 well tissue culture plates containing HAT (hypoxanthine, 
aminopterm, and thymidine) medium to inhibit proliferation of non-fiised cells, myeloma 
hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened in an ELISA for reactivity against PR0874 
polypeptide. Determination of "positive" hybridoma cells secreting the desired monoclonal 
25 antibodies against a PR0874 polypeptide is within the skill in the art. 

The positive hybridoma cells can be injected intraperitoneally into syngeneic Balb/c 
mice to produce ascites containing the anti-PR0874 polypeptide monoclonal antibodies. 
Alternatively, the hybridoma cells can be grown in tissue culture flasks or roller bottles. 
Purification of the monoclonal antibodies produced in the ascites can be accomplished using 
30 ammonium sulfate precipitation, followed by gel exclusion chromatography. Alternatively, 
affinity chromatography based upon binding of antibody to protein A or protein G can be 
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employed. 



The following materials have been deposited with the American Type Culture 
Collection, 12301 Parklawn Drive, Rockville. MD, USA (ATCC): 
Material ATCC Pep- No. Peposjit Pate 

DNA40621-1440 June 2, 1998. 



This deposit was made under the provisions of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure and the 

10 Regulations thereunder (Budapest Treaty) . This assures maintenance of a viable culture of the 
deposit for 30 years from the date of deposit. The deposit will be made available by ATCC 
under the terms of the Budapest Treaty, and subject to an agreement between Genentech, Inc. 
and ATCC, which assures permanent and unrestricted availability of the progeny of the culture 
of the deposit to the public upon issuaax; of the pertinent U.S. patent or upon laying open to 

15 the public of any U.S. or foreign patent application, whichever comes first, and assures 
availability of the progeny to one determined by the U.S. Commissioner of Patents and 
Trademarks to be entitled thereto according to 35 USC §122 and the Commissioner's rules 
pursuant thereto (including 37 CFR §1.14 with particular reference to 886 OG 638). 

The assignee of the present application has agreed that if a culture of the materials on 

20 deposit should die or be lost or destroyed when cultivated under suitable conditions, the 
materials will be promptly replaced on notification with another of the same. Availability of 
the deposited material is not to be construed as a license to practice the invention in 
contravention of the rights granted under die authority of any government in accordance with 
its patent laws. 

25 The foregoing written specification is considered to be sufficient to enable one skilled 

in the art to practice the invention. The present invention is not to be limited in scope by the 
construct deposited, since the deposited embodiment is intended as a single illustration of 
certain aspects of the invention and any constructs that are functionally equivalent are widiin 
the scope of this invention. The deposit of material herein does not constitute an admission 

30 that the written description herein contained is inadequate to enable the practice of any aspect 
of the invention, including the best mode thereof, nor is it to be construed as limiting the scope 
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of the claims to the specific illustrations that it represents. Indeed, various modifications of 
the invention in addition to those shown and described herein will become apparent to those 
skilled in the art from the foregoing description and faU within the scope of the appended 
claims. 
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WWAT TS CT .AIMED IS: 

1 . Isolated nucleic acid comprising DNA having at least an 80% sequence identity 
to (a) a DNA molecule encoding a PR0874 polypeptide comprising the sequence of amino acid 
residues 1 to 321 of Figure 2 (SEQ ID N0:3). or (b) the complement of the DNA molecule 
of (a). 

5 

2. The nucleic acid of Claim 1, wherein said DNA comprises the nucleotide 
sequence of SEQ ID N0:1 or its complement. 

3. The nucleic acid of Claim 1, wherein said DNA comprises nucleotides 1-963 
10 of the nucleotide sequence of SEQ ID NO: 1 (SEQ ID NO:2). 

si 

4. Isolated nucleic acid comprisii^ DNA having at least an 80% sequence identity 
to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein 

S cDNA in ATCC Deposit No (DNA40621-1440), or (b) the 

;^ 15 complement of the DNA molecule of (a). 

5. The nucleic acid of Claim 4 which comprises a DNA molecule encoding the 
same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. __„ 

I- (DNA40621-1440). 

m 

20 

6. Isolated nucleic acid comprising DNA having at least an 80 % sequence identity 
to (a) a DNA molecule encoding a PR0874 polypeptide comprising the sequence of amino acid 
residues about X to 321 of Figure 2 (SEQ ID NO:3), or b) the complement of the DNA 
molecule of (a), wherein X is any one of amino acid residues 270 to 279 of Figure 2 (SEQ ID 

25 NO:3). 

7. A vector comprising the nucleic acid of any one of Claims 1 to 6. 

8. The vector of Claim 7 operably linked to control sequences recognized by a host 
30 cell transformed witti the vector. 
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9. 



A host cell comprising the vector of Qaiin 7. 



10. The host ceU of Claim 9, wherein said ceU is a CHO cell. 

1 1 . The host cell of Claim 9, wherein said ceil is an coli. 

5 

12. The host cell of Claim 9, wherein said cell is a yeast cell. 

13. A process for producing a PR0874 polypeptide comprising culturing the host 
cell of Claim 9 under conditions suitable for expression of said PR0874 polypeptide and 

10 recovering said PR0874 polypeptide from the cell culmre. 

h 14. Isolated native sequence PR0874 polypeptide comprising amino acid residues 

1 to 321 of Figure 2 (SEQ ID N0:3). 

1 4 

■;f 15 15. Isolated PR0874 polypeptide comprising ammo acids X to 321 of the amino 

■ ^ acid sequence shown in Figure 2 (SEQ ID NO:3), wherein X is any one of amino acids 270 

^ to 279 of Figure 2 (SEQ ID NO;3). 

jj 16. Isolated PR0874 polypeptide encoded by the cDNA insert of the vector 

'"' 20 deposited as ATCC Accession No. . (DNA40621-1440). 

17 . A chimeric molecule comprising a PR0874 polypeptide fused to a heterologous 
amino acid sequence. 

25 18. The chuneric molecule of Claun 17, wherein said heterologous amino acid 

sequence is an epitope tag sequence. 

19. The chimeric molecule of Claim 17, wherein said heterologous amino acid 
sequence is a Fc region of an immunoglobulin. 

30 

20. An antibody which specifically binds to a PR0874 polypeptide. 
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The antibody of Claim 20, wherein said antibody is a monoclonal antibody. 
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Abstract of the Disclosure 
The present invention is directed to novel multi-span transmembrane polypeptides and 
to nucleic acid molecules encoding those polypeptides. Also provided herein are vectors and 
host cells comprising those nucleic acid sequences, chimeric polypeptide molecules comprising 
the polypeptides of the present invention fused to heterologous polypeptide sequences, 
antibodies which bind to the polypeptides of the present invention and to methods for 
producing the polypeptides of the present invention. 
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FIGURE 3A 

GGGCCCCTGGTGCTGGTGTGATCCTGGGAGTGCTGGGCGTGCTGGCATAT 
, GCTGATCCTGGGAGTGCTGGGCGTGCTGGCATAT 

++++++++++++++++. .++++++++++++++++++++++++++++++++ 
. GGGCCCCTGGTGCTGGTGTGATCCTGGGAGTGCTGGGCGTGCTGGCATAT 

GGCATCTACTACTGCTGGGAGGAGTACCGAGTG-TGCGGGACAAGGGCGC 
; GGCATCTACTACTGCTGGGAGGAGTACCGAGTGCTGCGGGACAAGGGCGC 

++++++++ +++ +++ ++++++++++++++++++++++++++++++++■*■++* 
L GGCATCTACTACTGCTGGGAGGAGTACCGAGTGCTGCGQGACAAQGGCGC 

) CTCCATCTCCCAGCTGGGTTTCACCACCAAC - TCAGTGCCTACCAGAGCG 
^ CTCCATCTCCCAGCTGGGTTTCACCACCAACCTCAGTGCCTACCAGAGCG 
' '^^^^■^■^ CTCAGTGCCTACCAGAGCG 

++++++++++++++++++++++++++++++++++++++++++++++++++ 
L CTCCATCTCCCAGCTGGQTTTCACCACCAACCTCAGTGCCTACCAGAGCG 

TGCAGGAGACCTGGCTGGCCGCCCTGATCGTGTTGGCGGTGCTTGAAGCC 
TGCAGGAGACCTGGCTGGCCGCCCTGATCGTGTTGGCGGTGCTTGAAGCC 
TGCAGGAGACCTGGCTGGCCGCCCTGATCGTGTTGGCGGTGCTTQAAGCC 
TGGCTGGCCGCCCTGATCGTGTTGGCGGTGCTTGAAGCC 

AAGCC 

++++++++++++++++++++++++++++++++++++++++++++■'"'""'■+"'■ 
TGCAGGAGACCTGGCTGGCCGCCCTGATCGTGTTGGCGGTGCTTGAAGCG 

ATCCTGCTGCTGGTGCTCATCTTCCTGCGGCAGCGGA-TTCGTATTGCCA 
ATCCTGCTQCTGGTGCTCATCTTCCTGCGGCAN-GGA-TTCGTATTGCCA 
ATCCTGCTGCTGATGCTCATCTTCCTGCGGCA-CGGANTTCGTATTGC 
ATCCTGCTGCTGATGCTCATCTTCCTGCGGCAG- GGA- TTCGTATTGCCA 
ATNCTGCTGCTGATGNTNATNTTNCTGCGGCAG-GGA- TTCGTATTGCCA 
ATCCTGCTGCTGATGCrCATCTTCCTGCGGCAGCGGA-TTCGTATTGCCA 

TTGCCA 

++.+++++++++ ++.+.++.++.++++++++.++++ ++++++++++++ 
ATCCTGCTGCTGGTGCTCATCTTCCTGCGGCAGCGGA TTCGTATTGCCA 

TCGCCCTCCTGAAGGAGQCCAGCAAGGCTGTGGGACAQATGAT 

TCGCCCTCCTGAAGGAGGCCAGCAAGGCTGTGGGACAGATGATGTCTACC 
TCGGCCTNCTGAAGGAGGCCAGCAAGGCTGTGGGACAGATGATGTCTACC 
TCGCCCTCCTGAAGGAGGCCAGCAAGGCTGTGGGACAGATGATGTCTACC 
TCGCCCTCCTGAACGANGCCAGCAAGGCTGTGGGACAGATGATGTCTACC 
CTGAAGGAGGCCAGCAAGGCTGTGGGACAGATGATGTCTACC 
.++++++++++++++++++++ 



+ + + . + + + , + + + + + . ++ . + + + + + + + + + + + + + ++++ +++ + + + +++ + t-r-r-rTT 

250 TCGCCCTCCTGAAGGAGGCCAGCAAGGCTGTGGGACAGATGATGTCTACC 
138 ATGT 

104 ATGTTCTACCCACTGGTCAACTTTGTCCTNCTTCTCATCTGCATTGCCTA 
100 ATGTTCTACCCACTGGTCACCTTTGTCCTCCTCCTCATCTGCATTGCCTA 
57 ATGTTCTACCCACTGGTCACCTTTGTCCTCCTCCTCATCTGCATTGCCTA 
43 ATGTTCTACCCACTGGTCACCTTTGTCCTCCTCCTCATCTGCATTGCCTA 
1 CCTTTGTCCTCCTCCTCATCTGCATTGCCTA 
+++++++++++++++++++ . +++++++++ . ++ . +++++++++++++++++ 
300 ATGTTCTACCCACTGGTCACCTTTGTCCTCCTCCTCATCTGCATTGCCTA 

154 - CTGGGCCATGACTGCTCTGTANCTGGGTACATCGGGGCAACCCCAGTAT 
150 - CTGGGCCATGACTGCTCT-TACCTGGCTACATCGGGGCAACCCCAGTAT 
107 - CTGGGCCATGACTGCTCTGTACCTGGCTACATCGGGGCAACCCCAGTAT 
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FIGURE 3B 

93 -CTGGGCCATGACTGCTCTGTACCTGGCTACATCGGGGCAACCCCAGTAT 
^2 ACTGGGCCATGACTGCTCTGTACCTGGCTACATCGGGGCAACCCCANTAT 

1 ctgSncacgagtgntctgtacctggntacatcggggcaaccccaot^^ 
J. uivauoiN cggctcgagcggggcaaccccagtat 
+++++ ++,++.++. ++++++ .+.++.+..+. +++++++++++++ . +++ 
350 ctgggccatgactgctctgtacctggctacatcggggcaaccccagtat 

203 gtgctctgggnatncaacatc-agnt-nccccggctgtgagaaagtgc 
198 gtgctctgggcatccaacatc-agct-cccccggctgtgaga 

156 gtgctctgggcatccaacatc-agct - cccccggctgtgagaaagtgcca 
142 gtgctctgggcatccaacatc-agct-cccccggctgtgagaaagtgcca 
8 2 gtgctctgggcatccaacatccagctncccccggctgtgagaaagtgcca 
50 gtgctctgggcatccaacatc-agct-cccccggctgtgagaaagtgcca 
27 gtgctctgggcatccaacatc-agct - cccccggctgtgagaaagtgcca 
++++++++++.++.+++++++ ++.+ ,++++++++++++++++++++++ 

399 GTGCTCTGGGCATCCAACATC AGCT CCCCCGGCTGTGAGAAAGTGCCA 

204 ATAAATACATCATGCAACCCCACGGCCCACCTT6TGAACTCCTCGTGCCC 
190 ATAAATACATCNTGCAACCCCACGGCCCACCTTGTGAACTCCTCGTGCCC 
132 ATAAATACATCATGCAACCCCACGGCCCACCTTGTGAACTCCTCGTGCCC 

98 -TAAATACATCATGCAACCCCACGGCCCACCTTGTGAACTCCTCGTGCCC 
7 5 ATAAATACATCATGCAACCCCACGGCCCACCTTGTGAACTCCTCGTGCCC 
1 ACATCATGCAACCCCACGGCCCACCTTGTGAACTCCTCGTGCCC 
X TGCAACCCCACGGCCCACCTTGTGAACTCCTCGTGCCC 
++++++++++++++++++++++++++ +++++••"*•+++++ 
.TAAATACATCATGCAACCCCACGGCCCACCTTGTGAACTCCTCGTGCCC 



AGGG- CTGATGTGCGTCTTCCAGGG- TACTCATCCAAAGGCCTAATCtJAA 
AGGG - CTGATGTGCGTCTTCCAGGGCTACTCATCCAAAGGCCTAATCCAA 
AGGG-CTGATGTGCGTCTTCCAGGGCTACTCATCCAAAGGCCTAATCCAA 
AGGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAGGCCTAATCCAA 
AGGG - CTGATGTGCGTCTTCCAGGGCTACTCATCCAAAGGCCTAATCCAA 
CTCATCCAAAGGCCTAATCCAA 
CAA 

* _ + + + 4- + + + + +++ + + + + + + + + +++ + + + + + +++ +++ + ++ + + + + + + + + + + + + 



CGTTCTGTCTTCAATCNGCAAATCTATGGGGTCCTGGGGCA 
CGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGAC 
CGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTQ 
CGTTCTGTCTTCAATCTGCAAATCTATGGNGTCCTGGGGCTCTTCTGGAC 
CGTTCTGTCTTCAATCTGCAAATCTATGGNGTCCTGGGGCTCTTCTGGAC 
CGTTCTGTCTTCAATCTGCAAATCTATGGNGTCCTGGGGCTCTTCTGGAC 
CGTTCTGTCTTCAATCTGCAAATCTATGGJSTGTCCTGGGGCTCTTCTGGAC 
ATGGGGTCCTGGGGCTCTTCTGGAC 
TNGGGCTCTTCTGGAC 
CTCTTCTGGAC 
CTCTTCTGGAC 

++++++++++++++++ . ++++++++++++ , +++++ . ++++ . +++++++++ 
547 CGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGAC 



835107 

3210594 

2264423 
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7 3 ccttaactgggtactggccctgggccaatgcgtcctcgctggagcctttg 

54 CCTTAACTGGGTACTGGCCCTGGGCCAATGCGTCCTCX3CTGGAGCCTTTG 

26 CCTTAACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGCAGCCTTTG 

17 CCTTAACTGGQTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTG 

12 CCTTAACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTG 

12 CCTTAACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTG 

1 ^ 
+++++++++++++++++++++++++++++++++++++++++•++++++++ 

597 CCTTAACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTG 

195 CCTCCTTCTACTGGGCCTTCCACAAGCCCCAGGACATCCCTACCTTCCCC 
188 CCTCCTTCTACTGGGCCTTCCACAAGCCCCAGGACATCCCTACCTTCCCG 
123 CCTCCTTGTACTGGGCCTTCCACAAGGCCCAGGACATCCCTACCTTCCCC 
104 CCTCCTTCTACTGGGCCTTCCACAAGCCCCAGGACATCCCTACCTTCCCC 
7 6 CCTCCTTGTACTGGGCCTTCCACAAGCCCCAGGACATCCCTACCTTCCCC 
6 7 CCTCCTNCTACTGGGCCTTCCACAAGCCCCAGGACATCCCTNCTTNCCCN 
6 2 CCTCCTTCTACTGGGCCTTCCACAAGCCCCAGGACATCCCTACCTTCCCC 
62 CCTCCTTCTACTGGGCCTTCCACAAGCGCCAGGACATCCCTACCTTCCCC 

2 CCTCCTTCTACTGGGCCTTCCACAAGCCCCAGGACATCCCTACCTTCCCC 
++++++, ++++++++++++++++++++++++++++++++++ . + • + • +++ • 

647 CCTCCTTCTACTGGGCCTTCCACAAGCCCCAGGACATCCCTACGTTCCCC 

245 TTAATCTCTGCCTTCATCCGCACACTCCGTNACCACACTGGGTCA 
238 TTAATCTCTGCCT 

173 TTAATCTCTGCCTTCATCCGCACACTCCGTTACCACACTAA 
154 TTAATCTCTGCCTTCATCCGCACACTCCGTTACCACACTGGGTCAT 
126 TTAATCTCTGCCTTCATCCGCACACTCCGTTACCACACTGGGTCATTGGC 
117 TAANTCTCTGCNTTCATCCGCACACTCCGTTACCACACTGGGTCATTGGC 
112 TTAATCTCTGCCTTCATCCGCACACTCCGTTACCACACTGGGTCATTGGC 
112 TTAATCTCTGCCTTCATCCGCACACTCCGTTACCACACTGGGTCATTGGC 
52 TTAATCTCTGCCTTCATCCGCACACTCCGTTACCACACTGGGTCATTGGC 
1 CACACTCCGTTACCACACTGGGTCATTGGC 
^ ATTGGC 
+ + 4,++++ + +.+++ +++++++++++++++,++++++++. .+++++++++ 
697 TTAATCTCTGCCTTCATCCGCACACTCCGTTACCACACTGGGTCATTGGC 

176 ATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTGG 
167 ATTTGGAGCCNTCATCCTGGACCTTGTGKNGATAGCCCGGGGTCG 
162 ATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTGG 
162 ATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTGG 
102 ATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTGG 
3 1 ATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTGG 
7 ATTTGGAGCNCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTGG 
1 GGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTGG 
1 CCTGACCCTTGTGCAGATAGCCCGGGTCATCTTGG 
1 GATAGCCCGGGTCATCTTGG 

+++++++++ . . ++++++++ . . +++++++ . . +++++++++++ +++++ 

74 7 ATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTGG 

226 AGTANATTGACCACAAG-TCAGAGGAGTGCAGAACCCTGTAGCCCGCTGC 

212 AGTAT 
212 AGTAT 

152 AGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGC 
8 1 AGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGC 
5 7 AGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGC 
4 7 AGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGC 

3 6 AGNATATTGACCACAAGCTCAGANGAGTGCAGANCCCTGTAGCCCGCTGC 
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Si 6 73 3 
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FIGURE 3E 

1 cCTGGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAA 
n CTGGACAAAGTCACAGACCTNCTGGTGTTCTTTGGGAA 
^ ACCTGCTGCTGTTCTTTGGGAA 
^X^^4. + + + + + + + + + + + ++ + + + ++.++..+ + +•+•++ + + + + + + + + 

997 agSgtggtcgtcctggacaaagtcacagacctgctgctgttctttgggaa 

184 gctgctggtggtcggaggcgtgggggtcctgtccttctttttttt-ctcc 
lit gSgctggtggtcngaggcgtgggggtcctgtoittctntntttttctcc 
III gctcctStgStSgaggcgtgggggtcctgtccttctttttttttctcc 

122 GC?^?S?GGTSi^GGCGTGGGGGTCCTGTCCTTCTTTTTTTT-CTC^ 
^06 GCTGCTGGTGGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTT-CTGC 

lot gS^SgSggtcggaggcgtgg^^^ 

10 gSgct?ctg?tcSSotgggggtcSot^ 

11 Sgctggtggtcggaggcgtgggggtcctgtccttctttttttttctc^ 
tl gSgctStggtcggaggcgtgggggtcctgtccttot^ 

39 gSnctggtggtnggaggcgtgggggtnctgtccttctttttttt-ctnc 

23 3SSS?GGTCGGAGGCGTGGG^^^^ 

} CGTGGGGGTCCTGTCCTTCTTTTTTTT - CTCC 

^ GTCCTTCTTTTTTTT - CTCC 

+++ ++ +++++++++ ++ ... + .• + • + •++++"'■■'■■'■■■'■ 
1047 SCTcSGSTCGicGGAGGCGTOGGGGTCCTGicCTTCTTTTTTTTTCTCC 

233 GGTCGCATCCCGGGGCTGGGTAAAGACTTTAAGAGCCCC 

171 ^?S?atcSggSctgggtaaagactttaagagcccccacctcaacta 
155 ggtcacatcccggggctgggtaaagactttaagagcccccacctcaacta 
153 SSSttccSnggctgggtaaagactttaagagcccccanctta^^^ 
115 ggtcgcatcccggggctgggtaaagactttaagagcccccacctcaacta 
99 S??gS?c?cg3gSgggtaaagactttaagagcccccacctcaact^ 
92 ggtcgcatcccggggctgggtaaagactttaagagcccccacctcaacta 
90 Sc^atccSgggctgggtaaagactttaagagcccccacctca^^ 
88 ggtcggatnccggggctgggtaaagactttaagagnccccanctnaacta 
7I ^?Satcccggggctgggtaaagactttaagagcccccacctcaacta 
3 4 ggtnggantccggggntgggtaaagactttaagagcccccacctcaacta 
32 ggtcgcatcccggggctgggtaaagactttaagagcccccacctcaacta 
20 ggtcgcatcccggggctgggtaaagactttaagagcccccacctcaacta 

t cggctcganggtctcgagtnacatncntnant 
* cccacctcaacta 

1097 GgJ CGCATCCCGGGGCTGGGTAAAGACT 

223 ttactggctgcccatcatgacctccatcctggggggcctatgtcatcgcc 
221 ttactggctgcccatcatgacctccatcctggggg-cctatgtcatcgcc 
205 SaSggctgcccatcatgacctccatcctggggg-cctatgtcatc 
202 ttactggctgaccatnangacctacaaactggngn-actatgtcatgaca 
165 Sactggctgcccatcatgacctccatcctggggg- cctatgtcatcgcc 
III SaSggSgc5St5atgacctccatcctggggg-cc 
142 ttactggctgcccatcatgacctccatcctggggg-cctatgtcatcgcc 
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14 0 TTACTGGNTGCCCNTCATGACCTNCATCCTGGGGG- 
13 8 TTACTGGNTGCCCATNATGANCTNCATNCTGGGGG- 
121 TTACTGGCTGCCCATCATGACCTCCATCCTGGGGG- 
84 TTACTGGCTGCCCATCATGANCTCCATCCTGGGGG' 
82 TTACTGGCTGCCCATCATGACCTCCATCCTGGGGG- 
7 0 TTACTGGCTGCCCATCATGACCTCCATCCTGGGGG- 
44 TTACTGGCTGCCCATCATGACCTCCATCCTGGGGG- 
3 3 TCACTGGGTGCCCATCATGACCTCCATCCTGGGGG - 
14 TTACTGGCTGCCCATCATGACCTCCATCCTGGGGG- 
1 ACTNGCTGCCCATCATGACCTCCATCCTGGGGG- 
1 CCTCCATCCTGGGGG- 
1 ATCCTGGGGG- 



1147 

273 
270 
254 
251 
214 
198 
191 
189 
187 
170 
133 
131 
119 
93 
82 



CCTATGTCAT CGGC 
CCTATGTNATNGGC 
CCTATGTCATCGCC 
CCTATGTCATCGNC 
CCTATGTCATCGCC 
CCTATGTCATCGCC 
CCTATGTCATCGCC 
CCTATGTCATCGCC 
CCTATGTCATCGCC 
CCTATGTCATCGCC 
CCTATGTCATCGCC 
CCTATGTCATCGCC 
.++++++.++. . . . 
CCTATGTCATCGCC 



TTACTGGCTGCCCATCATGACCTCCATCCTGGGGG 
AGCGGTTNCTTCAGCGTTTTCGGGCATGTGTGTGGGACACGCTCTTTCCT 



AG-GG-TTCTTCAGCGTTT- CGG- 

NCGGG 

AG 

AGCGGCTTCTTCAGCGTTTTCGG- 
AGCGGCTTCTTCAGCGTTTTCGG- 
AGCGGGTTNTTCAGCGTTTTNGG - 
ANCGGNTTTTTNANCGTTTTNGG - 
AGCGGCTTCTTCAGCGTTTTCGG- 
AGCGGCTTCTTCAGCGTTTTCGG - 
AGCGGCTTCTTCAGCGTTTTCGG ■ 
AGCGGCTTCTTCAGCGTTTTCGG- 
AGCGGCTTCTTCAGCGTTTTCGG- 
AGCGGTTNCTTCAGCGTTTTCGG- 
AGCGGCTTCTTCAGCGTTTTCGG- 
AGCGNTNTCTTCAGCGTTTTCGG- 
AGCGG - TTCTTCAGCGTTTTCGG 
AGCGGCTTCTTCAGCGTTTTCGG ' 
CGGCTC 



CATGTGTGTGG - AA 



CATGTGTGTGG- 
CATGTGTGTGG- 
•GAATTGTGTGG- 
■NATGTGTGTGG- 

- CATGTGTGTGG - 

- CATGTGTGTGG - 
■CATGTQTGTGG- 
-CATGTGTGTGG- 
■ CATGTGTGTGG - 
-CAT 

- CATGTGTGTGG - 
- CATGTGTGTGG - 

- CATGTGTGTGG - 
-CATGTGTGTGG■ 
-GAGGTGTGTGG- 
-CATGTGTGTGG- 



ACACGCTCTT- 
ACACGCTCTT- 
■ACAAGGTNTT- 
•ACAAGGT-TT- 

■ ACACGCTCTT - 
■ACAAGNTCTT- 
- ACACGCTCTT - 
-ACACGCTCTT' 
-ACACGCTCTT' 

- ACACGCTCTT - 
ACACGCTCTT- 
ACACGCTCTT- 
ACACGNTCTT- 

■ ACACGCTCTT ■ 
■ACACGCTCTT• 
AGACCTGTAA- 



CCT 
■CCT 
■CCT 
■C-T 
-CCT 
-CCT 
-CCT 
-CCT 
-CCT 

CCT 
CCT 
CCT 
■CCT 
•CCT 
•CCT 
■GGC 



1196 AGCGGCTTCTTCAGCGTTTTCGG 



I ACACGCTCTT CCT 

323 CTGCTTCCTGGGGAAGGACCTGGGGAGCGTTAAGAACGGGTTCCCTGGGG 
245 CTGCTTCCTGG- -AAG-ACCTGG- -AGCGG-A 
238 CTGCTTCCTGG- -AAG-ACCTGG- -A 

236 CTGCTTCCGGG- -A- -GACCTGG- - AGCGG -AACAANGGNT-T 
232 CTGCTTC-TGG- -GAG-ACCTGG-.-GGCGG-AA 

217 CTGCTTCCTGG- -AAG-ACCTaG--AGCGG-AACAACGGCT-CCCTaG- - 
180 CTGCTTNCTGG- -AAG-ANCTGG- - AGCGG-AACAACGGCT-TCCTGG- - 
178 CTGCTTCCTGG- -AAG-ACCTGG- -AGCGG-AACAACGGCT-CCCTGG- - 
166 CTGCTTCCTGG- -AAG-ACCTGG- -AGCGG-AACAACGGCT-CCCTGG- - 
140 CTGCTTCCTGG- -AAG-ACCTGG- -AGCGG-AACAACGGCT-CCCTGG- - 
110 CTGCTTCCTGG- -AAG-ACCTGG- -AGCGG-AACAACGGCT-CCCTGG- - 
95 CTGCTTCCTGG- -AAG-ACCTNG- -AGCGG-AACAACGGCT-CCCTGG- - 
76 CTGCTTCCTGG- -AAG-ACCTGG- -AGCGG-AACAACGGCT-CCCTGG- - 
72 CTGCTTCCTGG- -AAG-ACCTGG- -AGCGG-AACAANGGCT-CCCTGG- - 
31 CTGCTTCCTGG- -AAG-ACCTGG- -AGCGG-AACAACGGCT-CCCTGG- - 
2 7 CTGCTTCCTGG- -AAG-ACCTGG- - AGCGG -AACAACGGCT- CCCTGG- - 
14 GAAGTGAGGTT - -QGATGGCTGG- - AGCGG-AACAACGGCT- CCCTGG- - 
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1243 CTGCTTCCTGG AAGGACCTGG AGCGG AACAACX3GCT CCCTGG 

ACCGGCCCTACTNACATGTTCCAAGAGCCTTTTTAAAGATTCTGGGGCAA 
ACCGG 



373 
258 

221 
219 
207 
181 
151 
136 
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113 
72 
68 



ACCGGCCCTACT- 
ACCX3GCCCTACT- 
ACCGGCCCTACT 
ACCGGCCCTACT- 
ACCGGCCCTACT- 
ACCNGCCCTACT- 
ACCGGCCCTACT- 
ACCGGCCCTACT- 
ACCGGCCCTACT- 
ACCGGCCCTACT 
ACCGGCCCTACT 



-ACANGT- 
-ACATGT- 

■ACATGT- 
-ACATGT- 
-ACATGT- 
-ACATGT- 
- ACATGT 
-ACATGT 
-ACATGT 
-ACATGT 
GT 



•TCAAGA 

. CCAAGAGCCTTCT - AAAGATTCTGGG - CAA 



-CCAAGAGCCTTCT- 
-CCAAGAGCCTTCT- 
-CCAAGAGCCTTCT- 
- CCAAGAGCCTTCT ■ 
"CCAAGAGCCTTCT- 
-CCAAGAGCCTTCT- 
-CCAAGAGCCTTCT- 
-CCAAGAGCCTTCT- 
- CCAAGAGCCTTCT 
GAGCCTTCT 
T- 



+++.++++++++ +++.++ 

; ACCGGCCCTACT ACATGT 



.++++++++++.+ 

CCAAGAGCCTTCT 



■ AAAGATTCTGGG- CAA 
-AAAGATTCTGGG- CAA 
- AAAGATTCTGGG - CAA 
-AAAGATTCTGGG- CAA 
- A7AGATTCTGGG- CAA 
-AAAGATTCTGGG-CAA 
-AAAGATTCTGGG- CAA 
-AAAGATTCTGGG- CAA 
-AAAGATTCTGGG-CAA 
-AAAGATTCTGGG - CAA 
-AAAGATTCTGGG- CAA 
AGATTCTGGG - CAA 
GACTA-GTT 

+++++++ 

AAAGATTCTGGG CAA 



4 23 GAAGAACGAGGGCGCCCCCGGACAACAAGAAGAGG 

2 65 GAAGAACGAGG - CGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAG-TC 
227 G AAG AAC 

197 GAAGAACGAGG - CGCCCCCGGACAACAAGAAGAGOAAGAAGTGACAG - TC 
182 GAAGAACGAGG - CGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTC 
163 GAAGAACGAGG- CGCCCCCGGACAACAAGA 

159 GAAGAACGAGG-NGNCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTC 
118 GAAGAACGAGG- CGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTC 
114 GAAGAACGAGG - CGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTC 
102 GAAGAACGAGG - CGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTC 
3 1 GAAGAACGAGG - CGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTC 
2 5 GAAGAACGAGG - CGCCCCCGGAC AAC AAGAAGAGGAAGAAGTGACAGCTC 
1 7 GAAGAACGAGG - CGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTC 
14 GAAGAACGAGG- CGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTC 
9 CTAGAACGAGG- CGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTC 
1 AACGAGG - CGCCCCCGGAC AAC AAGAAGAGGAAGAAGTGACAGCNC 

1 AGAAGAGGAAGAAGTGACAGCTC 
, GGAAGAAGTGACAGCTC 
j[ GTGACAGCTC 
+++++++++ , + . +++++++++++++++-n-+++++++-f +++++■*•+ + . + 
13 31 GAAGAACGAGG CGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTC 

313 CGGCCCTGATTCA 
245 CGGCCC 

231 CGGCCCTGATCCAGGACTGCACCCCACCCNCACCGTCCAGCCATCCAACC 
208 CGGCCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACC 
167 CGGCCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACC 
163 CGGCCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACC 
151 CGGCCCTGATCCAGGACTGCACCCCACC 
8 0 CGGCCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACC 
74 CGGCCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACC 
6 6 CGGCCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACC 
6 3 CGGCCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACC 
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5 8 CGGCCCTGATCC AGOACTGCACCCCACCCCCACCaTCCAGCCATCCAACC 
4 6 CGGCCCTGNNCCAGGACNGCNCCCCACNCCCACCGTCCAGCCATCNAACC 
2 4 CGNNCCTNATCCAGGACTNCACCCCACCCNCACCGTNCAGCCATCCAACC 
1 8 CGGCCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACC 
1 1 CGGCCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACC 
1 CANGGACGTGCACCCCACCCCCACCGTCCAGCCATCCAACC 
1 CAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACC 
1 CAGCCATCCAACC 

++, .+++ + + ,++++++, + .++++++.++++++++.++++ 

13 80 CGGCCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACC 
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TKACTTNGCCTTACAGGTCTCCATTTTGTGGTAAAAAAAGGTTTTAGGCC 

TNACT 

TCACTTC 

TCACTTCGCCTTACAGGTCTCCATTTTGTGGTAAAAAAAGGTTTTAGGCC 
TCACTTCG G CTT ACAGGTCTCC ATTTTGTGGTAAAAAAAGGTTTTA 
TCACTTCGCCTTACAGGTCTCCATTTTGTGGTAAAASlAAGGTTTTA 

tcacttcgccttacaggtctccattttgtggtaaafiaaaggtttta 

tcacttcgccttacaggtctccattttgtggtaaaaaaaggtttta 

tcacttcgccttacaggtctccattttgtggtaaaaaaaggtttta 

tcacttcgccttacagqtctccantttgtggtaaaaanaggtttta 

tcacttcgncttacaggtctccattttgtggtaaaaaaaggtttttg 

tcacttcgccttacaggtctccattttgtggtaaaaaaaggtttta 

tcacttcgccttacaggtctccattttgtggtaaaaaaaggtttta 

TCACTTCGCCTTACAGGTCTCCATTTTNTGGTAAAAAAAGGTTTTAGGCC 

TCACTTCGCCTTACAGGTCTCCATTTTGTGGTAAAAAAAGGTTTTA 

TCACTTCGCCTTACAGGTCTCCATTTTGTGGTAAAAAAAGGTTTTA 

+ . ++++ , + . ++++++++++++++ , +++ , +++++++++ . +++++++ . ++++ 

TCACTTCGCCTTACAGGTCTCCATTTTGTGGTAAAAAAAGGTTTTAGGCC 



331 AGGGGNCGTGGCTTCACGCCTGTTAATTCAACACTTTGAGAGGNTTGAGG 
263 AGGCGCCGTGGCT - CACGCCTGT - AATCCAACACTTTGAGAGGCT- GAGG 
9 2 AGGCGCCGTGGCT - CACGCCTGT -AATCCAAC 

+++.+.+++++++ +++++++++ +++.+++++++++++++++.+ ++++ 
14 80 AGGCGCCGTGGCT CACGCCTGT AATCCAACACTTTGAGAGGCT GAGG 

381 CNGGCGGATTCAANTTGAGTTCAGGAGTTTCGAGANCAGTTGGGCCAACA 
310 CGGGCGGAT - CACCT - GAGT - CAGG AGTT - CGAGACC AGCCTGGCC AACA 
+ .+++++++ ++..+ ++++ ++++++++ +++++.+++...++•+ +++++ 
1527 CGGGCGGAT CACCT GAGT CAGGAGTT C6AGACCAGCCTGGCCAACA 

431 TTGTTGAAACTT 

356 TGGT-GAAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGG 
+ . ++ +++++ . +++++++++++++++++++++++++++++++++++++++ 
1573 TGGT GAAACCTCOGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGG 

405 TGGCATGCACTGTCATCCAGCTACTCGGGAGGCTGAG 

+ + + + + + + + + + + + + 4- + + + + + + + + + + + + + + + + + + + + +++ 

1622 TGGCATGCACTGTCATCCAGCTACTCGGGAGGCTGAG 
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